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METHODS AND PRODUCTS FOR ANALYZING POLYMERS 
Background 

The study of molecular and cellular biology is focused on the macroscopic structure 
of cells. We now know that cells have a complex micro structure that determine the 
functionality of the cell. Much of the diversity associated with cellular structure and function 
is due to the ability of a cell to assemble various building blocks into diverse chemical 
compounds. The cell accomplishes this task by assembling polymers from a limited set of 
building blocks referred to as monomers. The key to the diverse functionality of polymers is 
based in the primary sequence of the monomers within the polymer and is integral to 
understanding the basis for cellular function, such as why a cell differentiates in a particular 
manner or how a cell will respond to treatment with a particular drug. 

The ability to identify the structure of polymers by identifying their sequence of 
monomers is integral to the understanding of each active component and the role that 
component plays within a cell. By determining the sequences of polymers it is possible to 
generate expression maps, to determine what proteins are expressed, to understand where 
mutations occur in a disease state, and to determine whether a polysaccharide has better 
function or loses function when a particular monomer is absent or mutated. 

Expression maps relate to determining mRNA expression patterns. The need to 
identify differentially expressed mRNAs is critical in the understanding of genetic 
programming, both temporally and spatially. Different genes are turned on and off during the 
temporal course of an organisms' life development, comprising embryonic, growth, and 
aging stages. In addition to developmental changes, there are also temporal changes in 
response to varying stimuli such as injury, drugs, foreign bodies, and stress. The ability to 
chart expression changes for specific sets of cells in time either in response to stimuli or in 
growth allows the generation of what are called temporal expression maps. On the other 
hand, there are also body expression maps, which include knowledge of differentially 
expressed genes for different tissues and cell types. Expression maps are different not only 
between species and between individuals, but also between diseased and disease-free states. 
Examination of differential gene expression has yielded key discoveries of genes in widely 
varying disciplines, such as signal transduction (Smith et al., 1990), circadian rhythms (Loros 
et al., 1989), fruit ripening (Wilinson et al., 1995), hunger (Qu et al., 1996), cell cycle control 
(el-Deiry et al., 1993), apoptosis (Woronicz et al., 1994), and ischemic injury (Wang et al., 
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1995), among many others. Since generation of expression maps involve the sequencing and 
identification of cDNA or mRNA, more rapid sequencing necessarily means more rapid 
generation of multiple expression maps. 

Currently, only 1% of the human genome and an even smaller amount of other 
5 genomes have been sequenced. In addition, only one very incomplete human body 
expression map using expressed sequence tags has been achieved (Adams et al., 1995). 
Current protocols for genomic sequencing are slow and involve laborious steps such as 
cloning, generation of genomic libraries, colony picking, and sequencing. The time to create 
even one partial genomic library is on the order of several months. Even after the 

10 establishment of libraries, there are time lags in the preparation of DNA for sequencing and 
the running of actual sequencing steps. Given the multiplicative effect of these unfavorable 
facts, it is evident that the sequencing of even one genome requires an enormous investment 
of money, time, and effort. 

In general DNA sequencing is currently performed using one of two methods. The 

15 first and more popular method is the dideoxy chain termination method described by Sanger 
et al. (1977). This method involves the enzymatic synthesis of DNA molecules terminating 
in dideoxynucleotides. By using the four ddNTPs, a population of molecules terminating at 
each position of the target DNA can be synthesized. Subsequent analysis yields information 
on the length of the DNA molecules and the base at which each molecule terminates (either 

20 A, C, G, or T). With this information, the DNA sequence can be determined. The second 
method is Maxam and Gilbert sequencing (Maxam and Gilbert, 1977), which uses chemical 
degradation to generate a population of molecules degraded at certain positions of the target 
DNA. With knowledge of the cleavage specificities of the chemical reactions and the lengths 
of the fragments, the DNA sequence is generated. Both methods rely on polyacrylamide gel 

25 electrophoresis and photographic visualization of the radioactive DNA fragments. Each 
process takes about 1-3 days. The Sanger sequencing reactions can only generate 300-800 
bases in one run. 

Methods to improve the output of sequence information using the Sanger method also 
have been proposed. These Sanger-based methods include multiplex sequencing, capillary 
30 gel electrophoresis, and automated gel electrophoresis. Recently, there has also been 

increasing interest in developing Sanger independent methods as well. Sanger independent 
methods use a completely different methodology to realize the base information. This 
category contains the most novel techniques, which include scanning electron microscopy 
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(STM), mass spectrometry, enzymatic luminometric inorganic pyrophosphate detection assay 
(ELIDA) sequencing, exonuclease sequencing, and sequencing by hybridization. A brief 
summary of these methods is set forth below. 

Currently, automated gel electrophoresis is the most widely used method of large- 
5 scale sequencing. Automation requires reading of fluorescently labeled Sanger fragments in 
real time with a charge coupled device (CCD) detector. The four different dideoxy chain 
termination reactions are run with different labeled primers. The reaction mixtures are 
combined and co-electrophoresed down a slab of polyacrylamide. Using laser excitation at 
the end of the gel, the separated DNA fragments are resolved and the sequence determined by 
10 computer. Many automated machines are available commercially, each employing different 
detection methods and labeling schemes. The most efficient of these is the Applied 
Biosystems Model 377XL, which generates a maximum actual rate of 1 15,200 bases per day. 

In the method of capillary gel-electrophoresis, reaction samples are analyzed by small 
diameter, gel-filled capillaries. The small diameter of the capillaries (50 urn) allows for 
15 efficient dissipation of heat generated during electrophoresis. Thus, high field strengths can 
be used without excessive Joule heating (400 V/m), lowering the separation time to about 20 
minutes per reaction run. Not only are the bases separated more rapidly, there is also 
increased resolution over conventional gel electrophoresis. Furthermore, many capillaries are 
analyzed in parallel (Wooley and Mathies, 1995), allowing amplification of base information 
20 generated (actual rate is equal to 200,000 bases/day). The main drawback is that there is not 
continuous loading of the capillaries since a new gel-filled capillary tube must be prepared 
for each reaction. Capillary gel electrophoresis machines have recently been 
commercialized. 

Multiplex sequencing is a method which more efficiently uses electrophoretic gels 
25 (Church and Kieffer-Higgins, 1988). Sanger reaction samples are first tagged with unique 
oligomers and then up to 20 different samples are run on one lane of the electrophoretic gel. 
The samples are then blotted onto a membrane. The membrane is then sequentially probed 
with oligomers that correspond to the tags on the Sanger reaction samples. The membrane is 
washed and reprobed successively until the sequences of all 20 samples are determined. 
30 Even though there is a substantial reduction in the number of gels run, the washing and 
hybridizing steps are as equally laborious as running electrophoretic gels. The actual 
sequencing rate is comparable to that of automated gel electrophoresis. 
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Sequencing by mass spectrometry was first introduced in the late 80 's. Recent 
developments in the field have allowed for better sequence determination (Crain, 1990; Little 
et al., 1994; Keough et al., 1993; Smirnov et al., 1996). Mass spectrometry sequencing first 
entails creating a population of nested DNA molecules that differ in length by one base. 
5 Subsequent analysis of the fragments is performed by mass spectrometry. In one example, an 
exonuclease is used to partially digest a 33-mer (Smirnov, 1996). A population of molecules 
with similar 5' ends and varying points of 3' termination is generated. The reaction mixture is 
then analyzed. The mass spectrometer is sensitive enough to distinguish mass differences 
between successive fragments, allowing sequence information to be generated. 

10 Mass spectrometry sequencing is highly accurate, inexpensive, and rapid compared to 

conventional methods. The major limitation, however, is that the read length is on the order 
of tens of bases. Even the best method, matrix-assisted laser desorption ionization time-of- 
flight (MALDI-TOF) mass spectroscopy (Smirnov et al., 1996), can only achieve maximum 
read lengths of 80-90 base pairs. Much longer read lengths are physically impossible due to 

15 fragmentation of longer DNA at guanidines during the analysis step. Mass spectrometry 
sequencing is thus limited to verifying short primer sequences and has no practical 
application in large-scale sequencing. 

The Scanning tunneling microscope (STM) sequencing (Ferrell, 1997) method was 
conceived at the time the STM was commercially available. The initial promise of being able 

20 to read base-pair information directly from the electron micrographs no longer holds true. 
DNA molecules must be placed on conducting surfaces, which are usually highly ordered 
pyrolytic graphite (HOPG) or gold. These lack the binding sites to hold DNA strongly 
enough to resist removal by the physical and electronic forces exerted by the tunneling tip. 
With difficulty, DNA molecules can be electrostatically adhered to the surfaces. Even with 

25 successful immobilization of the DNA, it is difficult to distinguish base information because 
of the extremely high resolutions needed. With current technology, purines can be 
distinguished from pyrimidines, but the individual purines and pyrimidines cannot be 
identified. The ability to achieve this feat requires electron microscopy to be able to 
distinguish between aldehyde and amine groups on the purines and the presence or absence of 

30 methyl groups on the pyrimidines. 

Enzymatic luminometric inorganic pyrophosphate detection assay (ELIDA) 
sequencing uses the detection of pyrophosphate release from DNA polymerization to 
determine the addition of successive bases. The pyrophosphate released by the DNA 



-5- 



polymerization reaction is converted to ATP by ATP sulfurylase and the ATP production is 
monitored continuously by firefly luciferase. To determine base specificity, the method uses 
successive washes of ATP, CTP, GTP, and TTP. If a wash for ATP generates 
pyrophosphate, one or more adenines are incorporated. The number of incorporated bases is 
5 directly proportional to the amount of pyrophosphate generated. Enhancement of generated 
sequence information can be accomplished with parallel analysis of many ELIDA reactions 
simultaneously. 

The main disadvantage is the short read length. Ronaghi et al. (1996) have only 
achieved a maximum read length of 1 5 bases because of the multiple washings needed. Since 

10 there are four washes per base read, this means that a total of 400 washes mush be performed 
for a read length of a hundred bases. If there is even 1% loss of starting material for each 
wash, after 400 washes there would be 1.8% of the starting material remaining, which is 
insufficient for detection. 

Exonuclease sequencing involves a fluorescently labeled, single-stranded DNA 

15 molecule which is suspended in a flowing stream and sequentially cleaved by an exonuclease. 
Individual fluorescent bases are then released and passed through a single molecule detection 
system. The temporal sequence of labeled nucleotide detection corresponds to the sequence 
of the DNA (Ambrose et al., 1993; Davis et al., 1992; Jett et al., 1989). Using a processive 
exonuclease, it theoretically is possible to sequence 10,000 bp or larger fragments at a rate of 

20 10 bases per second. 

In practice, exonuclease sequencing has encountered many difficulties in each of the 
steps. The labeling step requires that all four bases in the DNA be tagged with different 
fluorophores. Sterically, this is extremely unfavorable. Ambrose et al., 1993 has achieved 
complete labeling of two bases on a 7 kb strand of Ml 3 DNA. Furthermore, difficult optical 

25 trapping is needed to suspend DNA molecules in a flowing stream. The step is time intensive 
and requires considerable expertise. Lastly, single molecules of fluorophore need to be 
detected with high efficiency. Even a 1% error is significant. Improvements in detection 
from 65% to 95% efficiency have been achieved. The efficiency of detection has been 
pushed to the limit and it would be difficult to achieve further improvements. 

30 In the sequencing by hybridization method, a target DNA is sequentially probed with 

a set of oligomers consisting of all the possible oligomer sequences. The sequence of the 
target DNA is generated with knowledge of the hybridization patterns between the oligomers 
and the target (Bains, 1991; Cantor et al, 1992; Drmanac et al, 1994). There are two 
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possible methods of probing target DNA. The "Probe Up" method includes immobilizing the 
target DNA on a substrate and probing successively with a set of oligomers. "Probe Down" 
on the other hand requires that a set of oligomers be immobilized on a substrate and 
hybridized with the target DNA. With the advent of the "DNA chip," which applies 
5 microchip synthesis techniques to DNA probes, arrays of thousands of different DNA probes 
can be generated on a 1 cm 2 area, making Probe Down methods more practical. Probe Up 
methods would require, for an 8-mer, 65,536 successive probes and washings, which would 
take an enormous amount of time. On the other hand, Probe Down hybridization generates 
data in a few seconds. With perfect hybridization, 65,536 octamer probes would determine a 

10 maximum of 1 70 bases. With 65,536 "mixed" 1 1 -mers, 700 bases can be generated. 

In practice, Probe Up methods have been used to generate sequences of about 100 
base pairs. Imperfect hybridization has led to difficulties in generating adequate sequence. 
Error in hybridization is amplified many times. A 1% error rate reduces the maximum length 
that can be sequenced by at least 10%. Thus if 1% of 65,536 oligonucleotides gave false 

15 positive hybridization signals when hybridizing to a 200-mer DNA target, 75% of the scored 
"hybridizations" would be false (Bains, 1997). Sequence determination would be impossible 
in such an instance. The conclusion is that hybridization must be extremely effective in order 
to generate reasonable data. Furthermore, sequencing by hybridization also encounters 
problems when there are repeats in sequences that are one base less than the length of the 

20 probe. When such sequences are present, multiple possible sequences are compatible with 
the hybridization data. 

The most common limitation of most of these techniques is a short read length. In 
practice a short read length means that additional genetic sequence information needs to be 
sequenced before the linear order of a target DNA can be deciphered. The short fragments 

25 have to be bridged together with additional overlapping fragments. Theoretically, with a 500 
base read length, a minimum of 9 x 10 9 bases need to be sequenced before the linear 
sequence of all 3 x 10 9 bases of the human genome are properly ordered. In reality, the 
number of bases needed to generate a believable genome is approximately 2 x 10 10 bases. 
Comparisons of the different techniques show that only the impractical exonuclease 

30 sequencing has the theoretical capability of long read lengths. The other methods have short 
theoretical read lengths and even shorter realistic read lengths. To reduce the number of 
bases that need to be sequenced, it is clear that the read length must be improved. 
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Protein sequencing generally involves chemically induced sequential removal and 
identification of the terminal amino acid residue, e.g., by Edman degradation. See Stryer, L., 
Biochemistry, W. H. Freeman and Co., San Francisco (1981) pp. 24-27. Edman degradation 
requires that the polypeptide have a free amino group which is reacted with an 
5 isothiocyanate. The isothiocyanate is typically phenyl isothiocyanate. The adduct 
intramolecularly reacts with the nearest backbone amide group of the polymer thereby 
forming a five membered ring. This adduct rearranges and the terminal amino acid residue is 
then cleaved using strong acid. The released phenylthiohydantoin (PTH) of the amino acid is 
identified and the shortened polymer can undergo repeated cycles of degradation and 
10 analysis. 

Further, several new methods have been described for carboxy terminal sequencing of 
polypeptides. See Inglis, A. S., Anal. Biochem. 195:183-96 (1991). Carboxy terminal 
sequencing methods mimic Edman degradation but involve sequential degradation from the 
opposite end of the polymer. See Inglis, A. S., Anal. Biochem. 195:183-96 (1991). Like 

1 5 Edman degradation, the carboxy-terminal sequencing methods involve chemically induced 
sequential removal and identification of the terminal amino acid residue. 

More recently, polypeptide sequencing has been described by preparing a nested set 
(sequence defining set) of polymer fragments followed by mass analysis. See Chait, B. T. et 
al., Science 257:1885-94 (1992). Sequence is determined by comparing the relative mass 

20 difference between fragments with the known masses of the amino acid residues. Though 
formation of a nested (sequence defining) set of polymer fragments is a requirement of DNA 
sequencing, this method differs substantially from the conventional protein sequencing 
method consisting of sequential removal and identification of each residue. Although this 
method has potential in practice it has encountered several problems and has not been 

25 demonstrated to be an effective method. 

Each of the known methods for sequencing polymers has drawbacks. For instance 
most of the methods are slow and labor intensive. The gel based DNA sequencing methods 
require approximately 1 to 3 days to identify the sequence of 300-800 units of a polymer. 
Methods such as mass spectroscopy and ELIDA sequencing can only be performed on very 

30 short polymers. 

An enormous need exists for de noveau polymer sequence determination. The rate of 
sequencing has limited the capability to generate multiple body and temporal expression 
maps which would undoubtedly aid the rapid determination of complex genetic function. A 
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need also exists for improved methods for analyzing polymers in order to speed up the rate at 
which diagnosis of diseases and preparation of new medicines is carried out. 



Summary of the Invention 

5 The invention relates to new methods and products for analyzing polymers and in 

particular new methods and products useful for determining the sequence of polymers. The 
invention has surprising advantages over prior art methods used to sequence polymers. Prior 
to the present invention no method or combination of methods has come close to achieving 
the rate of sequencing which the instant invention is capable of achieving. Using the methods 

10 of the invention the entire human genome can be sequenced several orders of magnitude 

faster than could be accomplished using conventional technology. In addition to sequencing 
the entire genome, the methods and products of the invention can be used to create 
comprehensive and multiple expression maps for developmental and disease processes. The 
ability to sequence an individual's genome and to generate multiple expression maps will 

15 greatly enhance the ability to determine the genetic basis of any phenotypic trait or disease 
process. 

The method for analyzing polymers according to the invention is based on the ability 
to examine each unit of a polymer individually. By examining each unit individually the type 
of unit and the position of the unit on the backbone of the polymer can be identified. This 

20 can be accomplished by positioning a unit at a station and examining a change which occurs 
when that unit is proximate to the station. The change can arise as a result of an interaction 
that occurs between the unit and the station or a partner and is specific for the particular unit. 
For instance if the polymer is a nucleic acid molecule and a T is positioned in proximity to a 
station a change which is specific for a T occurs. If on the other hand, a G is positioned in 

25 proximity to a station then a change which is specific for a G will occur. The specific change 
which occurs depends on the station used and the type of polymer being studied. For 
instance the change may be an electromagnetic signal which arises as a result of the 
interaction. 

The methods of the invention broadly encompass two types of methods for analyzing 
30 polymers by identifying a unit (or in some cases a group of units) within a polymer. The first 
type of method involves the analysis of at least a single polymer. An individual unit of the 
single polymer in one aspect is caused to interact with an agent such that a change, e.g., 
energy transfer or quenching occurs and produces a signal. The signal is indicative of the 
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identity of the unit. In another aspect an individual unit is exposed to a station resulting in a 
detectable physical change to the unit or station. The change in the unit or station produces a 
signal which can be detected and is characteristic of that particular unit. The second type of 
method involves the analysis of a plurality of polymers. A unit of each of the plurality of 
5 polymers is positioned at a station where an interaction can occur. The interaction is one 
which produces a polymer dependent impulse that specifically identifies the unit. The 
polymer dependent impulse may arise from, for example, energy transfer, quenching, 
changes in conductance, mechanical changes, resistance changes, or any other physical 
change. 

10 The proposed method for analyzing polymers is particularly useful for determining 

the sequence of units within a DNA molecule and can eliminate the need for generating 
genomic libraries, cloning, and colony picking, all of which constitute lengthy pre- 
sequencing steps that are major limitations in current genomic-scale sequencing protocols. 
The methods disclosed herein provide much longer read lengths than achieved by the prior art 

15 and a million-fold faster sequence reading. The proposed read length is on the order of 
several hundred thousand nucleotides. This translates into significantly less need for 
overlapping and redundant sequences, lowering the real amount of DNA that needs to be 
sequenced before genome reconstruction is possible. 

Methods for preparing polymers for analysis are also claimed herein. The 

20 combination of the long read length and the novel preparation methods results in a much 

more stream-lined and efficient process. Lastly, the actual time taken to read a given number 
of units of a polymer is a million-fold more rapid than current methods because of the 
tremendous parallel amplification supplied by a novel apparatus also claimed herein, which is 
referred to as a nanochannel plate or a microchannel plate. The combination of all these 

25 factors translates into a method of polymer analysis including sequencing that will provide 
enormous advances in the field of molecular and cell biology. 

The ability to sequence polymers such as genomic DNA by the methods described in 
the instant invention will have tremendous implications in the biomedical sciences. The 
recovery of genetic data at such a rapid pace will advance the Human Genome Project. The 

30 methods and products of the invention will allow the capability to prepare multiple 

expression maps for each individual, allowing complete human genetic programs to be 
deciphered. The ability to compare pools of individual genetic data at one time will allow, 
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for the first time, the ability to discover not only single gene diseases with ease, but also 
complex multigene disorders as rapidly as the DNA itself is sequenced. 

In one aspect the invention is a method for analyzing a polymer of linked units. The 
method involves the steps of exposing a plurality of individual units of a polymer to an agent 
selected from the group consisting of electromagnetic radiation, a quenching source and a 
fluorescence excitation source, individual units interacting with the agent to produce a 
detectable signal, and detecting sequentially the signals resulting from said interaction to 
analyze the polymer. In one embodiment the signal is electromagnetic radiation. In another 
embodiment the agent is electromagnetic radiation. According to an embodiment of the 
invention individual units of the polymer are labeled with a fluorophore. 

The plurality of individual units of the polymer may be sequentially exposed to 
electromagnetic radiation by bringing the plurality of individual units in proximity to a light 
emissive compound and exposing the light emissive compound to electromagnetic radiation, 
and wherein the plurality of individual units of the polymer detectably affect emission of 
electromagnetic radiation from the light emissive compound. In another embodiment the 
plurality of individual units of the polymer are sequentially exposed to electromagnetic 
radiation, and wherein the electromagnetic radiation detectably affects emission of 
electromagnetic radiation from the plurality of individual units of the polymer to produce the 
detectable signal. 

According to another embodiment of the invention the method involves the step of 
moving the polymer through a nanochannel in a wall material in order to locate the detectable 
signal. The plurality of individual units of the polymer are sequentially exposed to the agent 
by moving the polymer through a nanochannel in a wall material and exposing the plurality 
of individual units of the polymer to the agent at an interaction station at the nanochannel. 
The agent can be attached to (embedded in, covalently attached to the surface of or coated on 
the surface of) the wall material. In one embodiment the wall material includes a plurality of 
nanochannels, an interaction station at each nanochannel, and a plurality of polymers is 
moved through said nanochannel, only one polymer passes the interaction station at any 
given time (more than one polymer may be in a single nanochannel at a given time as long as 
they do not overlap), and signals resulting from the interaction of individual units of the 
polymers and the agent at the interaction stations are detected simultaneously. Preferably the 
nanochannel is fixed in the wall material. 
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The signals which are detected can be stored in a database for further analysis. In one 
method of analysis these signals can be compared to a pattern of signals from another 
polymer to determine the relatedness of the two polymers. Alternatively the detected signals 
can be compared to a known pattern of signals characteristic of a known polymer to 
5 determine the relatedness of the polymer being analyzed to the known polymer. The analysis 
may also involve measuring the length of time elapsed between detection of a first signal 
from the first unit and a second signal from a second unit. In one embodiment the plurality of 
individual units are two units, a first unit at a first end of the polymer and a second unit at an 
opposite second end of the polymer. The time elapsed between the sequential detection of 
10 signals may indicate the distance between two units or the length of the polymer. 

The polymer may be any type of polymer known in the art. In a preferred 
embodiment the polymer is selected from the group consisting of a nucleic acid and a protein. 
In a more preferred embodiment the polymer is a nucleic acid. 

The units of the polymer which interact with the agent to produce a signal are labeled. 
15 The units may be intrinsically labeled or extrinsically labeled. In one embodiment only a 
portion of the units of the polymer are labeled. In another embodiment all of the units are 
labeled. In yet another embodiment at least two units of the polymer are labeled differently 
so as to produce two different detectable signals. The units of the polymer may be labeled 
such that each unit or a specified portion of the units is labeled or it may be randomly labeled. 
20 In another embodiment the plurality of individual units of the polymer are exposed to 

at least two stations positioned in distinct regions of the channel, wherein the interaction 
between the units of the polymer and the at least two stations produces at least two signals. 

In one embodiment the individual unit of the polymer is labeled with radiation and the 
signal is electromagnetic radiation in the form of fluorescence. 
25 In another embodiment the unit is exposed to the agent at a station. Preferably the 

station is a non-liquid material. 

In yet another embodiment the plurality of individual units of the polymer are 
exposed to at least two agents and the interaction between the units of the polymer and the at 
least two agents produces at least two signals. The at least two agents may be positioned in 
30 distinct regions of a channel through which the polymer passes. In one embodiment the at 
least two signals are different signals. In another embodiment the at least two signals are the 
same signals. 
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According to another aspect of the invention a method for analyzing a polymer of 
linked units is provided. The method involves the steps of moving a plurality of individual 
units of a polymer of linked units with respect to a station and detecting sequentially signals 
arising from a detectable physical change in the polymer or the station as individual units 
5 pass the station to analyze the polymer. This aspect of the invention also encompasses each 
of the embodiments discussed above. 

In one embodiment the station is an interaction station and the individual units are 
exposed at the interaction station to an agent that interacts with the individual unit to produce 
a detectable electromagnetic radiation signal characteristic of the interaction. In another 
10 embodiment the station is a signal generation station and the characteristic signal produced is 
a polymer dependent impulse. Preferably the station is a non-liquid material. 

In another aspect the invention is a method for analyzing a polymer of linked units by 
exposing a plurality of individual units of a polymer to a station to produce to produce a non- 
ion conductance signal resulting from the exposure of the units of the polymer to the station, 
15 and wherein the station is attached to a wall material having a surface defining a channel. 
This aspect of the invention also encompasses each of the embodiments discussed above. 

According to another aspect of the invention a method for identifying an individual 
unit of a polymer is provided. The method involves the steps of transiently exposing the 
individual unit of the polymer to an agent selected from the group consisting of 
20 electromagnetic radiation, a quenching source and a fluorescence excitation source, the 
identity of the individual unit being unknown, to generate an interaction with a detectable 
electromagnetic radiation signal characteristic of said individual unit, detecting said signal, 
and distinguishing said signal from signals generated from adjacent signal generating units of 
the polymer as an indication of the identity of the individual unit. 
25 The agent can be one or more fluorophores and preferably is at least three 

fluorophores. When the individual unit is transiently exposed to one or more fluorophores 
(agent) by positioning the individual unit within energy transfer proximity of the agent, 
fluorescence energy transfer occurs between the agent and the individual unit. The signal is 
detected by detecting the fluorescence energy transfer. 
30 In one embodiment the individual unit of the polymer is exposed to the agent by 

positioning the individual unit at an interaction station comprising a nanochannel in a wall 
material. Preferably the wall material comprises at least two layers, one of the layers 
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allowing signal generation and the other preventing signal generation and the nanochannel 
traverses both layers. 

According to another aspect the invention is a method for identifying an individual 
unit of a polymer. The method includes the steps of transiently moving the individual unit of 
the polymer relative to a station, the identity of the individual unit being unknown, detecting 
a signal arising from a detectable physical change in the unit or the station, and distinguishing 
said signal from signals arising from exposure of adjacent signal generating units of the 
polymer to the station as an indication of the identity of the individual unit. This aspect of 
the invention also encompasses each of the embodiments discussed above. 

In one embodiment the station is an interaction station and the individual units are 
exposed at the interaction station to an agent that interacts with the individual unit to produce 
a detectable electromagnetic radiation signal characteristic of the interaction. In another 
embodiment the station is a signal generation station and the characteristic signal produced is 
a polymer dependent impulse. 

In yet another aspect the invention is a method for determining the proximity of two 
individual units of a polymer of linked units. The method includes the steps of moving the 
polymer relative to a station, exposing individual units to the station to produce a 
characteristic signal arising from a detectable physical change in the unit or the station, 
detecting characteristic signals generated, and measuring the amount of time elapsed between 
detecting characteristic signals, the amount of time elapsed being indicative of the proximity 
of the two individual units. 

In one embodiment the station is an interaction station. In another embodiment the 
interaction station includes an agent and the agent is selected from the group consisting of 
electromagnetic radiation, a quenching source and a fluorescence excitation source and the 
characteristic signal is a detectable electromagnetic radiation signal. In another embodiment 
the interaction station is a nanochannel in a wall material. 

In certain other embodiments the station referred to is a signal generation station. In 
another embodiment the signal generation station includes a physical impulse source which 
interacts with the polymer to produce a characteristic signal which is a physical impulse. The 
physical impulse in one embodiment arises from a change in a physical quantity such as 
resistance or conductance as a result of the exposure of the physical impulse source to the 
unit of the polymer. In one embodiment the physical impulse arises from changes in 
capacitance or resistance caused by the movement of the unit between microelectrodes or 
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nanoelectrodes positioned adjacent to the polymer unit. For instance the signal generation 
station may include microelectrodes or nanoelectrodes positioned on opposite sides of the 
polymer unit. The changes in resistance or conductance which occur as a result of the 
movement of the unit past the electrodes will be specific for the particular unit. In another 
embodiment the physical impulse arises from a release of radioactive signal from the unit. In 
other embodiments it arises from piezoelectric tip, direct physical contact, and NMR-nuclear 
spin signal. 

The polymer may be any type of polymer known in the art. In a preferred 
embodiment the polymer is selected from the group consisting of a nucleic acid and a protein. 
In a more preferred embodiment the polymer is a nucleic acid. The polymers can be labeled, 
randomly or non randomly. Different labels can be used to label different linked units to 
produce different signals. In one embodiment the individual units of the polymer are labeled 
with a fiuorophore. 

A method for determining the order of two individual units of a polymer of linked 
units is provided in another aspect of the invention. The method involves the steps of moving 
the polymer linearly with respect to a station, exposing one of the individual units to the 
station to produce a signal arising from a detectable physical change in the unit or the station, 
exposing the other of the individual units to the station to produce a second detectable signal 
arising from a detectable physical change in the unit or the station, different from the first 
signal, and determining the order of the signals as an indication of the order of the two 
individual units. 

In one embodiment the station is an interaction station. In another embodiment the 
interaction station includes an agent and the agent is selected from the group consisting of 
electromagnetic radiation, a quenching source and a fluorescence excitation source and the 
characteristic signals produced are detectable electromagnetic radiation signals. In another 
embodiment the interaction station is a nanochannel in a wall material. 

In certain other embodiments the station referred to is a signal generation station. In 
another embodiment the signal generation station includes a physical impulse source which 
interacts with the polymer to produce a characteristic signal which is a physical impulse. The 
physical impulse in one embodiment arises from a change in a physical quantity such as 
resistance or conductance as a result of the exposure of the physical impulse source to the 
unit of the polymer. In one embodiment the physical impulse arises from changes in 
capacitance or resistance caused by the movement of the unit between microelectrodes or 
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nanoelectrodes positioned adjacent to the polymer unit. For instance the signal generation 
station may include microelectrodes or nanoelectrodes positioned on opposite sides of the 
polymer unit. The changes in resistance or conductance which occur as a result of the 
movement of the unit past the electrodes will be specific for the particular unit. In another 
5 embodiment the physical impulse arises from a release of radioactive signal from the unit. In 
other embodiments it arises from piezoelectric tip, direct physical contact, and NMR-nuclear 
spin signal. 

The polymer may be any type of polymer known in the art. In a preferred 
embodiment the polymer is selected from the group consisting of a nucleic acid and a protein. 
10 In a more preferred embodiment the polymer is a nucleic acid. The polymers can be labeled, 
randomly or non randomly. Different labels can be used to label different linked units to 
produce different signals. In one embodiment the individual units of the polymer are labeled 
with a fluorophore. In another embodiment the individual units of the polymer are labeled 
with radioactivity. 

15 According to yet another aspect of the invention a method for determining the 

distance between two individual units of a polymer of linked units is provided. The method 
involves the steps of causing the polymer to pass linearly relative to a station, detecting a 
characteristic signal generated as each of the two individual units passes by the station, 
measuring the time elapsed between the signals measured, repeating steps 1, 2 and 3 for a 

20 plurality of similar polymers to produce a data set, and determining the distance between the 
two individual units based upon the information obtained from said plurality of similar 
polymers by analyzing the data set. 

In one embodiment the station is an interaction station. In another embodiment the 
interaction station includes an agent and the agent is selected from the group consisting of 

25 electromagnetic radiation, a quenching source and a fluorescence excitation source. In 
another embodiment the characteristic signals produced are detectable electromagnetic 
radiation signals. In another embodiment the interaction station is a nanochannel in a wall 
material. 

In certain other embodiments the station referred to is a signal generation station. In 
30 another embodiment the signal generation station includes a physical impulse source which 
interacts with the polymer to produce a characteristic signal which is a physical impulse. The 
physical impulse in one embodiment arises from a change in a physical quantity such as 
resistance or conductance as a result of the exposure of the physical impulse source to the 
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unit of the polymer. In one embodiment the physical impulse arises from changes in 
capacitance or resistance caused by the movement of the unit between microelectrodes or 
nanoelectrodes positioned adjacent to the polymer unit. For instance the signal generation 
station may include microelectrodes or nanoelectrodes positioned on opposite sides of the 
5 polymer unit. The changes in resistance or conductance which occur as a result of the 
movement of the unit past the electrodes will be specific for the particular unit. In another 
embodiment the two linked units are detected at the signal generation station by measuring 
light emission at the station. In another embodiment the physical impulse arises from a 
release of radioactive signal from the unit. In other embodiments it arises from piezoelectric 

10 tip, direct physical contact, and NMR-nuclear spin signal. 

The polymer may be any type of polymer known in the art. In a preferred 
embodiment the polymer is selected from the group consisting of a nucleic acid and a protein. 
In a more preferred embodiment the polymer is a nucleic acid. The polymers can be labeled, 
randomly or non randomly. Different labels can be used to label different linked units to 

15 produce different signals. In one embodiment the individual units of the polymer are labeled 
with a fluorophore. 

According to another embodiment the plurality of similar polymers is a homogeneous 
population. In another embodiment the plurality of similar polymers is a heterogenous 
population. 

20 In another embodiment steps (1) - (4) are carried out substantially simultaneously. 

According to yet another aspect of the invention a method for detecting resonance 
energy transfer or quenching between two interactive partners capable of such transfer or 
quenching is disclosed. The method involves the steps of bringing the two partners in close 
enough proximity to permit such transfer or quenching, applying an agent to one of said 

25 partners, the agent selected from the group consisting of electromagnetic radiation, a 

quenching source and a fluorescence excitation source, shielding fluorescence resonance 
energy transfer and quenching occurring from electromagnetic radiation emission and 
interaction between said partners with a material shield, and detecting the emitted 
electromagnetic radiation. In a preferred embodiment the material shield is a conductive 

30 material shield. 

In another aspect the invention is a method for analyzing a polymer of linked units. 
The method includes the steps of providing a labeled polymer of linked units, detecting 
signals from unit specific markers of less than all of the linked units, and storing a signature 
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of said signals detected to analyze the polymer. In one embodiment all of the unit specific 
markers are detected. In another embodiment the polymer is partially and randomly labeled 
with unit specific markers. In yet another embodiment only a portion of the unit specific 
markers are detected. All of the units of the polymer are labeled with a unit specific marker 
5 in another embodiment. 

The labeled polymer of linked units in one embodiment is exposed to an agent 
selected from the group consisting of electromagnetic radiation, a quenching source and a 
fluorescence excitation source and the signals are produced by the interaction between a unit 
specific marker of the polymer and the agent. 
10 In one embodiment the signals are detected linearly. In another embodiment the 

signature of signals includes at least 10 signals, and preferably 20 signals. The signature of 
signals includes any information about the polymer. Preferably the signature of signals 
includes information about the order, distance and number of unit specific markers. 

In another embodiment the labeled polymer of linked units is moved with respect to a 
1 5 station and wherein the signals are generated upon exposure of a unit specific marker of the 
polymer to the station. The station may be an interaction station. 

The method in some embodiments is a method for identifying a unit specific marker 
of the polymer, the identity of the unit specific marker being indicative of the identity of at 
least one unit of the polymer. The unit specific marker is transiently exposed to a station to 
20 produce signals characteristic of said unit specific marker and the signal is distinguished from 
signals generated from adjacent signal generating unit specific markers of the polymer as an 
indication of the identity of the unit specific marker. The station may be an interaction 
station including an agent selected from the group consisting of electromagnetic radiation, a 
quenching source and a fluorescence excitation source and wherein the signals are detectable 
25 electromagnetic radiation signals. 

The method in other embodiments is a method for determining the proximity of two 
unit specific markers of the polymer wherein the proximity of the two unit specific markers is 
the signature of said signals, the identity of each unit specific marker being indicative of the 
identity of at least one unit of the polymer. The labeled polymer is moved relative to a 
30 station to expose the two unit specific markers to the station to produce a characteristic signal 
arising from a detectable physical change in the unit specific marker or the station, and the 
amount of time elapsed between detecting each characteristic signal is measured, the amount 
of time elapsed being indicative of the proximity of the two unit specific markers. 



-18- 



The method may also be a method for determining the order of two unit specific 
markers of the polymer, the identity of each unit specific marker being indicative of the 
identity of at least one unit of the polymer wherein the order of the two unit specific markers 
is the signature of said signals. The labeled polymer is moved linearly with respect to a 
5 station, to expose one of the unit specific markers to the station to produce a signal which is a 
unit specific marker and to expose the other of the unit specific markers to the station to 
produce a second detectable which is a unit specific marker, different from the first signal. 
The order of the signals determined is an indication of the order of the two unit specific 
markers. 

10 The method in an embodiment is a method for determining the distance between two 

unit specific markers of the polymer, the identity of each unit specific marker being 
indicative of the identity of at least one unit of the polymer wherein the distance between two 
unit specific markers is the signature of said signals. The labeled polymer is moved linearly 
relative to a station to produce a characteristic signal generated as each of the two unit 

15 specific markers passes by the station and the distance between the signals is determined as 
an indication of the distance between the two unit specific markers. 

The method is a method for characterizing a test labeled polymer, wherein a plurality 
of labeled polymers is exposed to a station to obtain the signature of signals for each of the 
plurality of labeled polymers in another embodiment. The method involves the steps of 

20 comparing the signature of signals of the plurality of polymers, determining the relatedness of 
the polymers based upon similarities between the signature of signals of the polymers, and 
characterizing the test polymer based upon the signature of signals of related polymers. 

According to yet another embodiment the method is a method for sequencing a 
polymer of linked units. A signature of signals is obtained from each of a plurality of 

25 overlapping polymers, at least a portion of each of the polymers having a sequence of linked 
units identical to the other of the polymers, and the signature of signals is compared to obtain 
a sequence of linked units which is identical in the plurality of polymers. 

The method in another embodiment is a method for analyzing a set of polymers, each 
polymer of said set being an individual polymer of linked units and wherein the set of 

30 polymers is oriented parallel to one another and a polymer specific feature of said polymers is 
detected. 

Each of the above methods is based on an interaction between a polymer and a station 
involving in some embodiments energy transfer or quenching between a unit and an agent 
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which results in the generation of a signal and in other embodiments a physical change in the 
unit or station which results in the generation of a signal. Each of the methods can be 
performed on many polymers simultaneously or on as few as one polymer at a time. 
Methods for analyzing multiple polymers at one time based on an interaction 
5 involving polymer dependent impulses between the unit and the station also can be 

performed. These methods, which are set forth below, are based on an interaction between a 
unit and a signal generation station which produces any type of polymer dependent impulse 
which can be detected. The polymer dependent impulse is generated by exposure of a unit of 
the polymer to a signal generation station but does not require that a physical change in the 
10 polymer unit or the station occur. For instance, the polymer dependent impulse may result 
from energy transfer, quenching, changes in conductance, mechanical changes, resistance 
changes, or any other physical change. 

A method for characterizing a test polymer is another aspect of the invention. A 
method for characterizing a test polymer is carried out by obtaining polymer dependent 
1 5 impulses for each of a plurality of polymers, comparing the polymer dependent impulses of 
the plurality of polymers, determining the relatedness of the polymers based upon similarities 
between the polymer dependent impulses of the polymers, and characterizing the test 
polymer based upon the polymer dependent impulses of related polymers. 

The plurality of polymers may be any type of polymer but preferably is a nucleic acid. 
20 In one embodiment the plurality of polymers is a homogenous population. In another 

embodiment the plurality of polymers is a heterogenous population. The polymers can be 
labeled, randomly or non randomly. Different labels can be used to label different linked 
units to produce different polymer dependent impulses. 

The polymer dependent impulses provide many different types of structural 
25 information about the polymer. For instance the obtained polymer dependent impulses may 
include an order of polymer dependent impulses or the obtained polymer dependent impulses 
may include the time of separation between specific signals or the number of specific 
polymer dependent impulses. 

In one important embodiment the polymer dependent impulses are obtained by 
30 moving the plurality of polymers linearly past a signal generation station. 

According to another aspect the invention is a method for determining the distance 
between two individual units of a polymer of linked units. The method involves the steps of 
(1) causing the polymer to pass linearly relative to a station, (2) detecting a polymer 
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dependent impulse generated as each of the two individual units passes by the signal 
generation station, (3) measuring the time elapsed between the polymer dependent impulses 
measured, (4) repeating steps 1, 2 and 3 for a plurality of similar polymers to produce a data 
set, and (5) determining the distance between the two individual units based upon the 

5 information obtained from said plurality of similar polymers by analyzing the data set. In one 
embodiment steps (1) - (4) are carried out substantially simultaneously. 

The plurality of polymers may be any type of polymer but preferably is a nucleic acid. 
In one embodiment the plurality of polymers is a homogenous population. In another 
embodiment the plurality of polymers is a heterogenous population. The polymers can be 

10 labeled, randomly or non randomly. Different labels can be used to label different linked 
units to produce different polymer dependent impulses. 

In one embodiment the polymer dependent impulse measured is an electromagnetic 
radiation signal generated. In another embodiment the two linked units are detected at the 
signal generation station by measuring light emission at the station. The signal generation 

1 5 station can be a nanochannel. 

According to another aspect the invention is a method for determining the order of 
two individual units of a polymer of linked units. The method involves the steps of (1) 
moving the polymer to linearly with respect to a signal generation station, (2) exposing one of 
the individual units to the station to produce a polymer dependent impulse, (3) exposing the 

20 other of the individual units to the station to produce a second polymer dependent impulse, 
(4) repeating steps 1 , 2 and 3 for a plurality of similar polymers to produce a data set, and (5) 
determining the order of the two individual units based upon the information obtained from 
said plurality of similar polymers by analyzing the data set. In one embodiment steps (1) - 
(4) are carried out substantially simultaneously. In one embodiment the signal measured is 

25 an electromagnetic radiation signal. 

The plurality of polymers may be any type of polymer but preferably is a nucleic acid. 
In one embodiment the plurality of polymers is a homogenous population. In another 
embodiment the plurality of polymers is a heterogenous population. The polymers can be 
labeled, randomly or non randomly. Different labels can be used to label different linked 

30 units to produce different polymer dependent impulses. 

In one embodiment the polymer dependent impulse measured is an electromagnetic 
radiation signal generated. In another embodiment the two linked units are detected at the 
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signal generation station by measuring light emission at the station. The signal generation 
station can be a nanochannel. 

In another aspect of the invention a method for sequencing a polymer of linked units 
is provided. The method involves the steps of obtaining polymer dependent impulses from a 
5 plurality of overlapping polymers, at least a portion of the polymers having a sequence of 
linked units identical to the other of the polymers, and comparing the polymer dependent 
impulses to obtain a sequence of linked units which is identical in the plurality of polymers. 

In one embodiment the polymer dependent impulses are optically detectable. In 
another embodiment the nucleic acids are labeled with an agent selected from the group 
10 consisting of an electromagnetic radiation source, a quenching source, a fluorescence 
excitation source, and a radiation source. 

The plurality of polymers may be any type of polymer but preferably is a nucleic acid. 
In one embodiment the plurality of polymers is a homogenous population. In another 
embodiment the plurality of polymers is a heterogenous population. The polymers can be 
: 15 labeled, randomly or non randomly. Different labels can be used to label different linked 
: units to produce different polymer dependent impulses. 

A method for labeling nucleic acids is also provided. The method involves the step of 
contacting a dividing cell with a nucleotide analog, isolating from the cell nucleic acids that 
; have incorporated the nucleotide analog, and modifying the nucleic acid with incorporated 

- 20 nucleotide analog by labeling the incorporated nucleotide analog. In one embodiment the 
nucleotide analog is a brominated analog. 

The dividing cell may optionally be contacted with a nucleotide analog by growth 
arresting the cell in the cell division cycle, performing the contacting step, and allowing the 
cell to reenter the cell division cycle. The nucleic acids may then be isolated after the cells 
25 have reentered and completed the cell division cycle and before a second cell division cycle 
is completed. 

In another embodiment the incorporated nucleotide analog is labeled with an agent 
selected from the group consisting of an electromagnetic radiation source, a quenching source 
and a fluorescence excitation source. 
30 According to another aspect of the invention a method is provided for analyzing a set 

of polymers, each polymer of said set being an individual polymer of linked units. The 
method involves the step of orienting the set of polymers parallel to one another, and 
detecting a polymer specific feature of said polymers. In one embodiment the orientation 
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step is in a solution free of gel. The polymers may be oriented using any method. A 
preferred method for orienting the polymers is to apply an electric field to the polymers. 

The plurality of polymers may be any type of polymer but preferably is a nucleic acid. 
In one embodiment the plurality of polymers is a homogenous population. In another 
5 embodiment the plurality of polymers is a heterogenous population. The polymers can be 
labeled, randomly or non randomly. Different labels can be used to label different linked 
units to produce different polymer dependent impulses. 

The polymer specific feature is information about a structural feature of a polymer. 
The polymer specific feature can be an order of linked unity in the polymers. 
10 In one embodiment the detecting step is performed simultaneously for said polymers. 

In another embodiment the detection step comprises measuring electromagnetic radiation 
signals. According to a preferred embodiment the detection step comprises causing the 
polymers to pass linearly relative to a plurality of signal generation stations, and detecting 
and distinguishing polymer dependent impulses generated as said polymers pass said signal 
1 5 generation stations . 

A method for analyzing a set of polymers, each polymer of the set being an individual 
polymer of linked units is provided in another aspect of the invention. The method involves 
the steps of orienting the set of polymers in an electric field, simultaneously moving the set of 
polymers through defined respective channels, and detecting a polymer specific feature as the 
20 polymers are moved through the channels. In one embodiment the orientation step is in a 
solution free of gel. Preferably the channels are nanochannels. 

The plurality of polymers may be any type of polymer but preferably is a nucleic acid. 
In one embodiment the plurality of polymers is a homogenous population. In another 
embodiment the plurality of polymers is a heterogenous population. The polymers can be 
25 labeled, randomly or non randomly. Different labels can be used to label different linked 
units to produce different polymer dependent impulses. 

The polymer specific feature is information about a structural feature of a polymer. 
The polymer specific feature can be an order of linked unity in the polymers. 

In one embodiment the detecting step is performed simultaneously for said polymers. 
30 In another embodiment the detection step comprises measuring electromagnetic radiation 
signals. According to a preferred embodiment the detection step comprises causing the 
polymers to pass linearly relative to a plurality of signal generation stations, and detecting 
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and distinguishing polymer dependent impulses generated as said polymers pass said signal 
generation stations. 

According to yet another aspect of the invention an article of manufacture is provided. 
The article of manufacture includes a wall material having a surface defining a channel, an 

5 agent wherein the agent is selected from the group consisting of an electromagnetic radiation 
source, a quenching source, a luminescent film layer and a fluorescence excitation source, 
attached to the wall material adjacent to the channel, wherein the agent is close enough to the 
channel and is present in an amount sufficient to detectably interact with a partner compound 
selected from the group consisting of a light emissive compound, a light accepting 

10 compound, radiative compound, and a quencher passing through the channel. Preferably the 
channel is a support for a polymer. 

The agent in one embodiment is an electromagnetic radiation source and the 
electromagnetic radiation source is a light emissive compound. In another embodiment the 
channel is selected from the group consisting of is a microchannel and a nanochannel. 

15 According to another embodiment the surface of the wall material defining the 

channel is free of the light emissive compound. In another embodiment the light emissive 
compound is attached to an external surface of the wall material. In yet another embodiment 
the light emissive compound is attached to a linker which is attached to the external surface 
of the wall material. In still another embodiment the light emissive compound is embedded 

20 in the wall material or in a layer of or upon the wall material. The light emissive compound 
can be concentrated at a region of the external surface of the wall material that surrounds a 
portion of the channel in another embodiment. The light emissive compound may form a 
concentric ring in the wall material around a portion of the channel. A masking layer having 
openings which allow exposure of only localized areas of the light emissive compound may 

25 also be part of the article of manufacture. 

A second light emissive compound different from the first may be attached to the wall 
material adjacent to the channel, wherein the light emissive compound is close enough to the 
channel and is present in an amount effective to detectably interact with a partner light 
emissive compound passing through the channel. 

30 The wall material may be made up of different layers. In one embodiment the 

external surface of the wall material adjacent to the light emissive compound is a conducting 
layer. In another embodiment the wall material comprises two layers, the conducting layer 
and a nonconducting layer. The wall material may also be composed of at least two layers, a 
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first layer preventing signal generation and a second layer allowing signal generation. 

Alternatively the wall material adjacent to the light emissive compound is a light 

impermeable layer. In another embodiment the wall material comprises two layers, the light 

impermeable layer and a support light permeable layer. The wall material can be a second 
5 light impermeable layer on a second side of the light emissive compound, the first and second 

layers sandwiching the light emissive compound. In a preferred embodiment the light 

emissive compound is a fluorescent compound. 

The channel can have any shape or dimensions. Preferably the channel is a 

nanochannel which is between 1 Angstrom and 1 mm. In a preferred embodiment the width 
10 of the channel is between 1 and 500 Angstroms. Preferably the wall includes multiple 

channels. Preferably the wall material includes at least 2 and more preferably at least 50 

channels. 

In one embodiment the wall material is formed of two layers, a first light 
impermeable layer and a luminescent film layer attached to one another, wherein the channel 

15 extends through both layers and is defined by surfaces of both layers. Preferably the channel 
is a nanochannel. In a preferred embodiment the length of the channel is between 1 
Angstrom and 1 mm. The article in some embodiments includes a second light impermeable 
layer, the luminescent film layer positioned between the first and second light impermeable 
layers. In a preferred embodiment the surface defining the channel includes a surface of the 

20 light impermeable layer which is free of luminescent film layer material. 

In another embodiment the agent is a fluorescence excitation source and wherein the 
fluorescence excitation source is a scintillation layer. The scintillation layer may be selected 
from the group consisting of Nal(TI), ZnS(Ag), anthracene, stilbene, and plastic phosphors. 
Preferably the scintillation layer is embedded in the wall material between two radiation 

25 impermeable layers, such as lead or Lucite. 

In another aspect the invention is an article of manufacture which is a wall material 
having a surface defining a plurality of channels, and a station attached to a discrete region of 
the wall material adjacent to at least one of the channels, wherein the station is close enough 
to the channel and is present in an amount sufficient to cause a signal to arise from a 

30 detectable physical change in a polymer of linked units passing through the channel or in the 
station as the polymer is exposed to the station. 

According to another aspect of the invention an article of manufacture is provided. 
The article is a wall material having a surface defining a channel, and a plurality of stations 
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each attached to a discrete region of the wall material adjacent to the channel, wherein the 
stations are close enough to the channel and are present in an amount sufficient to cause a 
signal to arise from a detectable physical change in a polymer of linked units passing through 
the channel or in the station as the polymer is exposed to the station. 

5 A method for preparing a wall material is another aspect of the invention. The 

method involves the steps of covalently bonding light emissive compounds or quenching 
compounds to a plurality of discrete locations of a wall material, each of said discrete 
locations close enough to a respective interaction station on said wall material, whereby when 
an individual unit of a polymer, which is interactive with said light emissive compound or 

10 quenching compound to produce a signal, is positioned at said interaction station, the light 
emissive compound or the quenching compound interacts with the individual unit to produce 
the signal. In one embodiment the method includes the step of applying a layer of conductive 
material to said wall material. 

In another embodiment the light emissive compounds or quenching compounds are 

15 covalently bonded at discrete locations close to channels in said wall material, said channels 
defining interaction stations. The channels preferably are microchannels. In a more 
preferred embodiment the channels are nanochannels. The light emissive compounds or 
quenching compounds can be covalently bonded to the wall material in a manner whereby the 
surfaces of the wall material defining the channel are free of the light emissive compounds 

20 and quenching compounds. 

The invention also encompasses a method for attaching a chemical substance 
selectively at a rim of a channel through a wall material that is opaque. The method involves 
the steps of providing a wall material with photoprotective chemical groups attached at the 
rim of the channel through the wall material, applying light to the photoprotective chemical 

25 groups to dephotoprotect the chemical groups, and attaching the chemical substance to the 
deprotected chemical groups. 

In one embodiment the light is applied to only selected regions of a surface of the wall 
material defining the rim of the channel. In another embodiment the channel has a first end 
and a second end, the rim being at the first end, and wherein the light is applied to the second 

30 end, the light passing through the channel to contact the photoprotected chemical groups at 
the rim of said first end. The channels preferably are microchannels. In a more preferred 
embodiment the channels are nanochannels. 
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According to another aspect of the invention a method is provided for preparing a 
wall material having localized areas of light emission on a surface of the wall material. The 
method involves the steps of providing a wall material having a surface and applying a light 
emissive compound to the surface to produce at least localized areas of light emission on the 
5 surface, wherein the localized areas define a target region for detecting light emission, and 
wherein the target region is a rim of a channel through the wall material. In one embodiment 
the method further includes the steps of attaching a photoprotective chemical group to the 
surface of the wall material, applying light to the photoprotective chemical groups to 
dephotoprotect the chemical groups prior to attaching the light emissive compound, and 
10 attaching the light emissive compound to the dephotoprotected chemical groups. 

In one embodiment the light is applied to only selected regions of a surface of the wall 
material defining the rim of the channel. In a preferred embodiment the photoprotective 
chemical group is attached to only selected regions of the surface of the wall material 
defining the rim of the channel. In another embodiment the channel has a first end and a 
15 second end, the rim being at the first end, and wherein the light is applied to the second end, 
the light passing through the channel to contact the photoprotected chemical groups at the rim 
of said first end. The channels preferably are micro channels. In a more preferred 
embodiment the channels are nanochannels. 

The method can include the further step of positioning a mask having openings over 
20 the surface of the wall material such that only localized areas of light emission are exposed 
through the openings of the mask. In one embodiment the light emissive compound is 
attached to a portion of the surface of the wall material. 

According to another aspect of the invention an apparatus for detecting a signal is 
provided. The apparatus is a housing with a buffer chamber, a wall defining a portion of the 
25 buffer chamber, and having a plurality of openings for aligning polymers, a sensor fixed 

relative to the housing, the sensor distinguishing the signals emitted at each opening from the 
signals emitted at the other of the openings to generate opening dependent sensor signals, and 
a memory for collecting and storing said sensor signals. In a preferred embodiment the 
sensor is an optical sensor. 
30 In one embodiment the optical sensor senses electromagnetic radiation signals emitted 

at the plurality of openings. In another embodiment the apparatus includes a microprocessor. 

In one embodiment the openings are defined by channels in the wall. Preferably the 
openings are defined by microchannels in the wall. More preferably the openings are defined 
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by nanochannels in the wall. In one embodiment the plurality of openings is at least two. In 
a preferred embodiment the plurality is at least 50. 

In one embodiment the apparatus includes a second buffer chamber separated from 
said first buffer chamber, by said wall, and wherein the buffer chambers are in fluid 
5 communications with one another via the openings. In another embodiment the apparatus 
includes a pair of electrodes secured to the housing, one of said pair positioned in the first 
buffer chamber and the other of the pair positioned in the second buffer chamber. 

According to another aspect of the invention an apparatus for detecting a signal is 
provided. The apparatus includes a housing defining a first buffer chamber and a second 
10 buffer chamber, a wall supported by the housing and separating the first and second buffer 
chambers, a plurality of channels defined by the wall and providing fluid communications 
between the first and second buffer chambers, and a sensor for distinguishing and collecting 
channel dependent signals. Preferably the channel is a microchannel. More preferably the 
channel is a nanochannel. In one embodiment the plurality of channels is at least two. In a 
15 preferred embodiment the plurality is at least 50. Preferably the signal is an optical signal. 

In one embodiment the wall surrounding the channel includes an agent is selected 
from the group consisting of electromagnetic radiation, a quenching source and a 
fluorescence excitation source is attached to the wall. Preferably the agent is electromagnetic 
radiation and the electromagnetic radiation is a light emissive compound. In one embodiment 
20 the light emissive compound is concentrated at the channels in the wall. 

According to another embodiment the apparatus includes a means for moving 
biological entities through the channels. 

According to another aspect of the invention an apparatus including a housing with a 
buffer chamber, a wall material defining a portion of the buffer chamber, the wall including 
25 polymer interaction stations, and an optical sensor secured to the housing, the optical sensor 
constructed and arranged to detect electromagnetic radiation signals emitted at the interaction 
stations is provided. 

In another aspect the invention is a computer system for making characteristic 
information of a plurality of polymers available in response to a request. The system has a 
30 memory for storing, for the plurality of the polymers and in a manner accessible using a 
unique identifier for the polymer, records including information indicative of sequentially 
detected signals arising from a detectable physical change in the plurality of individual units 
of the polymer or a station to which the polymer is exposed and a processor for accessing the 
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records stored in the memory for a selected one of the plurality of the polymers according to 
a unique identifier associated with the selected polymer. 

In one embodiment the signal results from an interaction of a plurality of individual 
units of the polymer exposed to an agent selected from the group consisting of 
5 electromagnetic radiation, a quenching source and a fluorescence excitation source. In 
another embodiment the computer system also includes a means for comparing the 
sequentially detected signals of the selected polymer to a known pattern of signals 
characteristic of a known polymer to determine relatedness of the selected polymer to the 
known polymer. 

10 Each of the limitations of the invention can encompass various embodiments of the 

invention. It is, therefore, anticipated that each of the limitations of the invention involving 
any one element or combinations of elements can be included in each apparatus and each 
method. 

Brief Description of the Drawings 

15 Figure 1 shows a schematic of a random labeling method. 

Figure 2 is a graph of raw data demonstrating changes in energy emission patterns to 
determine distance information through the instantaneous rate method. The changes in 
energy patterns result from sequential detectable signals which when plotted produce a curve 
that from left to right shows two energy intensity decreases, followed by two energy intensity 

20 increases. The rate is 6.8 A/s and ti is the time between the entry of the first and second 
labels. 

Figure 3 shows a representative population of random A-labeled fragments for a 16- 
mer with the sequence 5' ACGTACGTACGTACGT 3' (SEQ. ID. No. 1) and also depicts 
how distance information is used to determine base separation between acceptor labeled 
25 nucleotides. 

Figure 4 is a schematic depicting various potential constructions of a nanochannel 

plate. 

Figure 5 is a schematic depicting a nanochannel plate for analyzing a radioactive 
polymer. 

30 Figure 6 is a schematic of a cross section of a nanochannel with a concentric ring of 

donor fiuorophores and a graph showing donor emission and energy transfer on the 
corresponding nanochannel. 
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Figure 7 shows a model of a nanochannel plate having multiple nanochannels with a 
layer of donor ffuorophores within the plate. Figure 7B shows an enlarged view of a single 
nanochannel with a single acceptor positioned adjacent to a conducting layer. 

Figure 8A and B are schematics demonstrating signal generation upon passage of an 
5 acceptor label through the nanochannel. Figure 8A shows an enlarged view of one 

nanochannel. Figure 8B graphically illustrates the sharp changes in donor intensity as an 
acceptor label moves from positions A to C. 

Figure 9 schematically and graphically demonstrates the passage of a two-base 
labeled strand of DNA through a nanochannel with the proper thickness and radii of donor 
10 fluorophores sandwiched between conducting material. 

Figure 10 is a volume graph visually demonstrating the change in donor emission in 
the presence of one and two acceptors. Figure 10A illustrates the amount of change as 
volumes. Figure 10B shows the change for one acceptor. 

Figure 1 1 shows the results of an experiment demonstrating that DNA can pass 
1 5 through fabricated nanochannels. 

Figure 12 shows an apparatus constructed to hold a nanochannel (or microchannel) 
plate which is capable of generating an electric field. 

Figure 13 depicts a nanochannel apparatus which consists of two fused Pyrex cells 
that hold the nanochannel plate, an upper and lower buffer region, electrodes, an immersion 
20 objective and a voltage supply. 

Figure 14 is a schematic diagram of in vitro base specific (IBS A) labeling 
Figure 15 shows the general scheme for deciphering DNA sequence from IBS A 
labeling. 

25 Detailed Description of the Invention 

The invention encompasses methods of analyzing or identifying a polymer or a unit of 
a polymer, by detecting a signal or polymer dependent impulse that results from an 
interaction between at least one unit of the polymer and a station or an agent or by a change 
in the unit or a station when the unit is exposed to the station. By "analyzing" a polymer, it is 
30 meant obtaining some information about the structure of the polymer such as its size, the 
order of its units, its relatedness to other polymers, the identity of its units, or its presence. 
Since the structure and function of biological molecules are interdependent, the structural 
information can reveal important information about the function of the polymer. 
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One method according to the invention is a method for analyzing a polymer of linked 
units by exposing a plurality of units of the polymer to an agent selected from the group 
consisting of electromagnetic radiation, a quenching source and a fluorescence excitation 
source such that each of the individual units interacts with the agent to produce a detectable 
5 signal. The signal resulting from the interaction is detected sequentially. 

As used herein a unit of a polymer is "exposed" to an agent or a station by positioning 
or presenting the unit and the agent or station in interactive proximity to one another such that 
energy transfer or quenching or a physical change in the unit or agent or station can occur 
between them to produce a detectable signal. By interactive proximity it is meant close 

10 enough to permit the interaction or change which yields the detectable signal. 

In one embodiment the units of the polymer are exposed sequentially to the agent. By 
"sequentially exposed" it is meant in general separated from one another in time. In a 
preferred embodiment, the polymer and the agent are caused to move relative to one another 
in a "linear" manner such that each unit of the polymer passes within interactive proximity to 

15 the agent at an interaction station. When each unit of the polymer interacts with the agent or 
station, a detectable signal is produced. This detectable signal can be captured (sequentially 
detected) and recorded by a detection device. The detectable signal produced for each unit 
can be indicative of the type of unit. As used herein signals are detected "sequentially" when 
signals from different units of a single polymer are detected spaced apart in time. Not all 

20 units need to be detected or need to generate a signal to detect signals "sequentially." 

When the units are sequentially exposed to the agent or station the unit and the agent 
or station move relative to one another. As used herein the phrase "the unit and the agent or 
station move relative to one another" means that either the unit and the agent or station are 
both moving or only one of the two is moving and the other remains stationary at least during 

25 the period of time of the interaction between the unit and the agent or station. The unit and 
the agent or station may be moved relative to one another by any mechanism. For instance 
the agent or station may remain stationary and the polymer may be drawn past the agent or 
station by an electric current. Other methods for moving the polymer include but are not 
limited to movement resulting from a magnetic field, a mechanical force, a flowing liquid 

30 medium, a pressure system, a gravitational force, and a molecular motor such as e.g., a UNA 
polymerase or a helicase when the polymer is DNA or e.g., myosin when the polymer is a 
peptide such as actin. The movement of the polymer may be assisted by the use of a channel, 
groove or ring to guide the polymer. Alternatively the agent or station may be moved and the 



-31- 

polymer may remain stationary. For instance the agent or station may be held within a 
scanning tip that is guided along the length of the polymer. 

In another embodiment signals are detected simultaneously. As used herein signals 
are "detected simultaneously" by causing a plurality of the labeled units of a polymer to be 

5 exposed to an agent or station at once. The plurality of the units can be exposed to an agent 
or station at one time by using multiple interaction sites. Signals can be detected at each of 
these sites simultaneously. For instance multiple agents or stations may be localized at 
specific locations in space which correspond to the units of the polymer. When the polymer 
is brought within interactive proximity of the multiple agents or stations signals will be 

10 generated simultaneously. This may be embodied, for example, in a linear array of agents or 
stations positioned at substantially equivalent distances which are equal to the distance 
between the units. The polymer may be positioned with respect to the agent or station such 
that each unit is in interactive proximity to an agent or station to produce simultaneous 
signals. 

15 When the signals are detected sequentially multiple polymers also can be analyzed 

simultaneously. Multiple polymers are analyzed simultaneously by causing more than one 
polymer to move relative to respective agent or stations at one time. The polymers may be 
similar or distinct. If the polymers are similar, the same or different units may be detected 
simultaneously. It is preferred that at least two polymers are analyzed simultaneously. In a 

20 more preferred embodiment at least 50 polymers are analyzed simultaneously and in another 
preferred embodiment at least 100 polymers are analyzed simultaneously. 

A "polymer" as used herein is a compound having a linear backbone of individual 
units which are linked together by linkages. In some cases the backbone of the polymer may 
be branched. Preferably the backbone is unbranched. The term "backbone" is given its usual 

25 meaning in the field of polymer chemistry. The polymers may be heterogeneous in backbone 
composition thereby containing any possible combination of polymer units linked together 
such as peptide- nucleic acids (which have amino acids linked to nucleic acids and have 
enhanced stability). In a preferred embodiment the polymers are homogeneous in backbone 
composition and are, for example, nucleic acids, polypeptides, polysaccharides, 

30 carbohydrates, polyurethanes, polycarbonates, polyureas, polyethyleneimines, polyarylene 
sulfides, polysiloxanes, polyimides, polyacetates, polyamides, polyesters, or polythioesters. 
In the most preferred embodiments, the polymer is a nucleic acid or a polypeptide. A 
"nucleic acid" as used herein is a biopolymer comprised of nucleotides, such as deoxyribose 
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nucleic acid (DNA) or ribose nucleic acid (RNA). A polypeptide as used herein is a 
biopolymer comprised of linked amino acids. 

As used herein with respect to linked units of a polymer, "linked" or "linkage" means 
two entities are bound to one another by any physicochemical means. Any linkage known to 
5 those of ordinary skill in the art, covalent or non-covalent, is embraced. Such linkages are 
well known to those of ordinary skill in the art. Natural linkages, which are those ordinarily 
found in nature connecting the individual units of a particular polymer, are most common. 
Natural linkages include, for instance, amide, ester and thioester linkages. The individual 
units of a polymer analyzed by the methods of the invention may be linked, however, by 
10 synthetic or modified linkages. Polymers where the units are linked by covalent bonds will 
be most common but also include hydrogen bonded, etc.. 

The polymer is made up of a plurality of individual units. An "individual unit" as 
used herein is a building block or monomer which can be linked directly or indirectly to other 
building blocks or monomers to form a polymer. The polymer preferably is a polymer of at 
15 least two different linked units. The at least two different linked units may produce or be 

labeled to produce different signals, as discussed in greater detail below. The particular type 
of unit will depend on the type of polymer. For instance DNA is a biopolymer comprised of 
a deoxyribose phosphate backbone composed of units of purines and pyrimidines such as 
adenine, cytosine, guanine, thymine, 5-methylcytosine, 2-aminopurine, 
20 2-amino-6-chloropurine, 2,6-diaminopurine, hypoxanthine, and other naturally and 

non-naturally occurring nucleobases, substituted and unsubstituted aromatic moieties. RNA 
is a biopolymer comprised of a ribose phosphate backbone composed of units of purines and 
pyrimidines such as those described for DNA but wherein uracil is substituted for thymidine. 
DNA units may be linked to the other units of the polymer by their 5' or 3' hydroxyl group 
25 thereby forming an ester linkage. RNA units may be linked to the other units of the polymer 
by their 5', 3' or 2' hydroxyl group thereby forming an ester linkage. Alternatively, DNA or 
RNA units having a terminal 5', 3' or 2' amino group may be linked to the other units of the 
polymer by the amino group thereby forming an amide linkage. The individual units of a 
polypeptide are amino acids, including the 20 naturally occurring amino acids as well as 
30 modified amino acids. Amino acids may exist as amides or free acids and are linked to the 
other units in the backbone of the polymers through their a-amino group thereby forming an 
amide linkage to the polymer. 

A "plurality of individual units" is at least two units linked to one another. 
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Whenever a nucleic acid is represented by a sequence of letters it will be understood 
that the nucleotides are in 5' -> 3' order from left to right and that "A" denotes adenosine, "C" 
denotes cytidine, "G" denotes guanosine, "T" denotes thymidine, and "U" denotes uracil 
unless otherwise noted. 

5 The polymers may be native or naturally-occurring polymers which occur in nature or 

non-naturally occurring polymers which do not exist in nature. The polymers typically 
include at least a portion of a naturally occurring polymer. The polymers can be isolated or 
synthesized de novo. For example, the polymers can be isolated from natural sources e.g. 
purified, as by cleavage and gel separation or may be synthesized e.g.,(i) amplified in vitro 

10 by, for example, polymerase chain reaction (PCR); (ii) synthesized by, for example, chemical 
synthesis; (iii) recombinantly produced by cloning, etc. 

The polymer or at least one unit thereof is in a form which is capable of interacting 
with an agent or station to produce a signal characteristic of that interaction. The unit of a 
polymer which is capable of undergoing such an interaction is said to be labeled. If a unit of 

15 a polymer can undergo that interaction to produce a characteristic signal, then the polymer is 
said to be intrinsically labeled. It is not necessary that an extrinsic label be added to the 
polymer. If a non-native molecule, however, must be attached to the individual unit of the 
polymer to generate the interaction producing the characteristic signal, then the polymer is 
said to be extrinsically labeled. The "label" may be, for example, light emitting, energy 

20 accepting, fluorescent, radioactive, or quenching. 

Many naturally occurring units of a polymer are light emitting compounds or 
quenchers. For instance, nucleotides of native nucleic acid molecules have distinct 
absorption spectra, e.g., A, G, T, C, and U have absorption maximums at 259 nm, 252 nm, 
267 nm, 271 nm, and 258 nm respectively. Modified units which include intrinsic labels may 

25 also be incorporated into polymers. A nucleic acid molecule may include, for example, any 
of the following modified nucleotide units which have the characteristic energy emission 
patterns of a light emitting compound or a quenching compound: 2,4-dithiouracil, 2,4- 
Diselenouracil, hypoxanthine, mercaptopurine, 2-aminopurine, and selenopurine. 

A unit may also be considered to be intrinsically labeled when a property of the unit 

30 other than a light emitting, quenching or radioactive property provides information about the 
identity of the unit without the addition of an extrinsic label. For instance the shape and 
charge of the unit provides information about the unit which can result in a specific 
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characteristic signal, such as a change in conductance arising from the blockage of a 
conductance path by the unit. 

If an extrinsic label is selected for use according to the methods of the invention, the 
type of extrinsic label selected will depend on a variety of factors, including the nature of the 
5 analysis being conducted, the type of the agent and the type of polymer. Extrinsic label 
compounds include but are not limited to light emitting compounds, quenching compounds, 
radioactive compounds, spin labels, and heavy metal compounds. The label should be 
stearically compatible and chemically compatible with the units of the polymer being 
analyzed. 

10 A "light emissive compound" as used herein is a compound that emits light in 

response to irradiation with light of a particular wavelength. These compounds are capable 
of absorbing and emitting light through phosphorescence, chemiluminescence, luminescence, 
polarized fluorescence, scintillators or, more preferably, fluorescence. The particular light 
emissive compound selected will depend on a variety of factors which are discussed in 

15 greater detail below. Light emissive compounds have been described extensively in the 
literature. For example, Haugland, R. P., Handbook of Fluorescent Probes and Research 
Chemicals, 6th edition, Molecular Probes, Inc., 1996, which is hereby incorporated by 
reference provides a description of light emitting compounds. 

Radioactive compounds are substances which emit alpha, beta, or gamma nuclear 

20 radiation. Alpha rays are positively charged particles of mass number 4 and slightly 

deflected by electrical and magnetic fields. Beta rays are negatively charged electrons and 
are strongly deflected by electrical and magnetic fields. Gamma rays are photons of 
electromagnetic radiation and are undeflected by electrical and magnetic fields and are of 
wavelength of the order to 10" 8 to 10" 9 cm. 

25 Generally, fluorescent dyes are hydrocarbon molecules having a chain of several 

conjugated double bonds. The absorption and emission wavelengths of a dye are 
approximately proportional to the number of carbon atoms in the conjugated chain. A 
preferred fluorescent compound is "Cy-3" (Biological Detection Systems, Pittsburgh, PA). 
Other preferred fluorescent compounds useful according to the invention include but are not 

30 limited to fluorescein isothiocyanate ("FITC"), Texas red, tetramethylrhodamine 

isothiocyanate ("TRITC"), 4, 4-difluoro-4-bora-3a, and 4a-diaza-s-indacene ("BODIPY"). 

Chemiluminescent compounds are compounds which luminesce due to a chemical 
reaction. Phosphorescent compounds are compounds which exhibit delayed luminescence as 
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a result of the absorption of radiation. Luminescence is a non-thermal emission of 
electromagnetic radiation by a material upon excitation. These compounds are well known in 
the art and are available from a variety of sources. 

In one embodiment of the invention the light emissive compound is a donor or an 
5 acceptor fiuorophore. A fluorophore as used herein is a molecule capable of absorbing light 
at one wavelength and emitting light at another wavelength. A donor fluorophore is a 
fluorophore which is capable of transferring its fluorescent energy to an acceptor molecule in 
close proximity. An acceptor fluorophore is a fiuorophore that can accept energy from a 
donor at close proximity. (An acceptor of a donor fluorophore does not have to be a 
10 fluorophore. It may be non-fluorescent.) Fluorophores can be photochemically promoted to 
an excited state, or higher energy level, by irradiating them with light. Excitation 
wavelengths are generally in the UV, blue, or green regions of the spectrum. The 
fluorophores remain in the excited state for a very short period of time before releasing their 
energy and returning to the ground state. Those fluorophores that dissipate their energy as 
15 emitted light are donor fluorophores. The wavelength distribution of the outgoing photons 
forms the emission spectrum, which peaks at longer wavelengths (lower energies) than the 
excitation spectrum, but is equally characteristic for a particular fluorophore. 

In another embodiment of the invention the unit is labeled with a radioactive 
compound. The radioactive compound emits nuclear radiation as it passes the agent or 
20 station. When the agent is a scintillation layer the nuclear radiation interacts with the 

scintillation layer and causes fluorescent excitation. A fluorescent signal indicative of the 
radioactively labeled unit can then be detected. 

Extrinsic labels can be added to the polymer by any means known in the art. For 
example, the labels may be attached directly to the polymer or attached to a linker which is 
25 attached to the polymer. For instance, fluorophores have been directly incorporated into 
nucleic acids by chemical means but have also been introduced into nucleic acids through 
active amino or thio groups into a nucleic acid. Proudnikov and Mirabekov, Nucleic Acids 
Research, 24: 4535-4532 (1996). Modified units which can easily be chemically derivitized 
or which include linkers can be incorporated into the polymer to enhance this process. An 
30 extensive description of modification procedures which can be performed on the polymer, the 
linker and/or the extrinsic label in order to prepare a bioconjugate can be found in 
Hermanson, G. T., Bioconjugate Techniques, Academic Press, Inc., San Diego, 1996, which 
is hereby incorporated by reference. 
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There are several known methods of direct chemical labeling of DNA (Hermanson, 
1996; Roget et al., 1989; Proudnikov and Mirzzbekov, 1996). One of the methods is based 
on the introduction of aldehyde groups by partial depurination of DNA. Fluorescent labels 
with an attached hydrazine group are efficiently coupled with the aldehyde groups and the 
5 hydrazone bonds are stabilized by reduction with sodium labeling efficiencies around 60%. 
The reaction of cytosine with bisulfite in the presence of an excess of an amine fluorophore 
leads to transamination at the N 4 position (Hermanson, 1996). Reaction conditions such as 
pH, amine fluorophore concentration, and incubation time and temperature affect the yield of 
products formed. At high concentrations of the amine fluorophore (3M), transamination can 
10 approach 1 00% yield (Draper and Gold, 1 980). 

Light emissive compounds can be attached to polymers or other materials by any 
mechanism known in the art. For instance, functional groups which are reactive with various 
light emissive groups include, but are not limited to, (functional group: reactive group of light 
emissive compound) activated estenamines or anilines; acyl azide:amines or anilines; acyl 
15 halide-.amines, anilines, alcohols or phenols; acyl nitrile:alcohols or phenols; aldehyde: amines 
or anilines; alkyl halide:amines, anilines, alcohols, phenols or thiols; alkyl sulfonate:thiols, 
alcohols or phenols; anhydride: alcohols, phenols, amines or anilines; aryl halide:thiols; 
aziridine:thiols or thioethers; carboxylic acid:amines, anilines, alcohols or alkyl halides; 
diazoalkanexarboxylic acids; epoxide :thiols; haloacetamide:thiols; halotriazine:amines, 
20 anilines or phenols; hydrazine: aldehydes or ketones; hydroxyamine:aldehydes or ketones; 
imido estenamines or anilines; isocyanate: amines or anilines; and isothiocyanate: amines or 
anilines. 

The agent that interacts with the unit of the polymer is selected from the group 
consisting of electromagnetic radiation, a quenching source, and a fluorescence excitation 
25 source. "Electromagnetic radiation" as used herein is energy produced by electromagnetic 
waves. Electromagnetic radiation may be in the form of a direct light source or it may be 
emitted by a light emissive compound such as a donor fluorophore. "Light" as used herein 
includes electromagnetic energy of any wavelength including visible, infrared and ultraviolet. 
As used herein, a quenching source is any entity which alters or is capable of altering 
30 a property of a light emitting source. The property which is altered can include intensity 
fluorescence lifetime, spectra, fluorescence, or phosphorescence. 

A fluorescence excitation source as used herein is any entity capable of fluorescing or 
giving rise to photonic emissions (i.e. electromagnetic radiation, directed electric field, 
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temperature, fluorescence, radiation, scintillation, physical contact, or mechanical disruption.) 
For instance, when the unit is labeled with a radioactive compound the radioactive emission 
causes molecular excitation of an agent that is a scintillation layer which results in 
fluorescence. 

When a unit of the polymer is exposed to the agent the interaction between the two 
produces a signal. The signal provides information about the polymer. For instance if all 
units of a particular type, e.g., all of the alanines, of a protein polymer are labeled (intrinsic or 
extrinsic) with a particular light emissive compound then when a signal characteristic of that 
light emissive compound is detected upon interaction with the agent the signal signifies that 
an alanine residue is present at that particular location on the polymer. If each type of unit 
e.g., each type of amino acid is labeled with a different light emissive compound having a 
distinct light emissive pattern then each amino acid will interact with the agent to produce a 
distinct signal. By determining what each signal for each unit of the polymer is, the 
sequence of units can be determined. 

The interaction between the unit and the agent can take a variety of forms, but does 
not require that the unit and the agent physically contact one another. Examples of 
interactions are as follows. A first type of interaction involves the agent being 
electromagnetic radiation and the unit of the polymer being a light emissive compound 
(either intrinsically or extrinsically labeled with a light emissive compound). When the light 
emissive unit is contacted with electromagnetic radiation (such as by a laser beam of a 
suitable wavelength or electromagnetic radiation emitted from a donor fiuorophore), the 
electromagnetic radiation causes the light emissive compound to emit electromagnetic 
radiation of a specific wavelength. The signal is then measured. The signal exhibits a 
characteristic pattern of light emission and thus indicates that a particular labeled unit of the 
polymer is present. In this case the unit of the polymer is said to "detectably affect the 
emission of the electromagnetic radiation from the light emissive compound." 

A second type of interaction involves the agent being a fluorescence excitation source 
and the unit of the polymer being a light emissive or a radioactive compound. When the light 
emissive unit is contacted with the fluorescence excitation source, the fluorescence excitation 
source causes the light emissive compound to emit electromagnetic radiation of a specific 
wavelength. When the radioactive unit is contacted with the fluorescence excitation source, 
the nuclear radiation emitted from the unit causes the fluorescence excitation source to emit 
electromagnetic radiation of a specific wavelength. The signal then is measured. 
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A variation of these types of interaction involves the presence of a third element of 
the interaction, a proximate compound which is involved in generating the signal. For 
example, a unit may be labeled with a light emissive compound which is a donor fluorophore 
and a proximate compound can be an acceptor fluorophore. If the light emissive compound 
is placed in an excited state and brought proximate to the acceptor fluorophore, then energy 
transfer will occur between the donor and acceptor, generating a signal which can be detected 
as a measure of the presence of the unit which is light emissive. The light emissive 
compound can be placed in the "excited" state by exposing it to light (such as a laser beam) 
or by exposing it to a fluorescence excitation source. 

Another interaction involves a proximate compound which is a quenching source. In 
this instance, the light emissive unit is caused to emit electromagnetic radiation by exposing 
it to light. If the light emissive compound is placed in proximity to a quenching source, then 
the signal from the light emissive unit will be altered. 

A set of interactions parallel to those described above can be created wherein, 
however, the light emissive compound is the proximate compound and the unit is either a 
quenching source or an acceptor source. In these instances the agent is electromagnetic 
radiation emitted by the proximate compound, and the signal is generated, characteristic of 
the interaction between the unit and such radiation, by bringing the unit in interactive 
proximity with the proximate compound. 

The mechanisms by which each of these interactions produces a detectable signal is 
known in the art. For exemplary purposes the mechanism by which a donor and acceptor 
fluorophore interact according to the invention to produce a detectable signal including 
practical limitations which are known to result from this type of interaction and methods of 
reducing or eliminating such limitations is set forth below. 

In a preferred embodiment the signal generated by the interaction between the unit 
and the agent results from fluorescence resonance energy transfer (FRET) between 
fluorophores. Either the unit or the proximate compound/agent may be labeled with either 
the donor or acceptor fluorophore. FRET is the transfer of photonic energy between 
fluorophores. FRET has promise as a tool in characterizing molecular detail because of its 
ability to measure distances between two points separated by 10 A to 100 A. The angstrom 
resolution of FRET has been used in many studies of molecular dynamics and biophysical 
phenomena (for reviews see Clegg, 1995; Clegg, 1992; Selvin, 1995; and Wu and Brand, 
1994). The resolving power of FRET arises because energy transfer between donor and 
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acceptor fluorophores is dependent on the inverse sixth power of the distance between the 
probes. In practice, this resolution is about an order of magnitude better than that of the 
highest resolution electron microscope. 

In order to undergo FRET, the emission spectrum of the donor overlaps with the 
excitation spectrum of the acceptor. The unit of the polymer is specifically labeled with an 
acceptor fluorophore. The agent is a donor fluorophore. A laser is tuned to the excitation 
wavelength of the donor fluorophore. As the polymer is moved through the channel, the 
donor fluorophore emits its characteristic wavelength. As the acceptor fluorophore moves 
into interactive proximity with the donor fluorophore, the acceptor fluorophore is excited by 
the energy from the donor fluorophore. The consequence of this interaction is that the 
emission of the donor fluorophore is quenched and that of the acceptor fluorophore is 
enhanced. 

In order to generate an optimal efficient FRET signal for detection, two conditions 
should be satisfied. The first condition is efficient donor emission in the absence of 
acceptors. The second is efficient generation of a change in either donor or acceptor 
emissions during FRET. 

In a model system, both the donor fluorophore and the acceptor fluorophore can give 
rise to photonic emissions indefinitely. In such circumstances, during energy transfer, the 
decrease in the donor emission is equal to the increase in the acceptor emission. In other 
words, the emission change is identical for both the donor and the acceptor. This is so 
because for every one donor emission quenching event, there is an equal and opposite 
acceptor emission event. Cantor and Schimmel, 1980 present an intuitive outline of this 
model system. Consider a system of two fluorophores, a donor and an acceptor. At a range 
within Forster distance, very weak coupling occurs. The interaction between the donor and 
the acceptor is summarized as: 

K T 

Dt + Aa ^ Da + A h 

K-T 

where D a and A a are the ground singlet states and Db and Ab are the first excited singlet states. 
kr and k.j are the forward and reverse rates of energy transfer. The above reaction states that 
for a given donor in the first excited state, it will rapidly lose energy by internal conversion 
until it reaches its ground state, D a . Since the acceptor absorption energies overlap with the 
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donor emission energies, very weak coupling occurs, allowing resonance energy transfer to 
take place. 

As a result of the resonance energy transfer, both the donor (D b ) and the acceptor (A b ) 
are in excited vibrational states. Vibrational relaxation rapidly brings these to their respective 
ground vibrational levels. Vibrational relaxation is very efficient, with the average lifetime 
of an excited vibrational state being only about 10" 15 s (Skoog et al., 1992). Internal 
conversion, on the other hand, for fluorescent molecules is about 10" 9 s. The difference in 
these rates means that even when k T is very efficient, the reverse reaction (t T ) is unlikely to 
occur. In this ideal system, there is thus a shift of the relative population of excited donors 
and acceptors. 

The above analysis sets forth a system utilizing a single acceptor and a single donor. 
The same type of interaction may be performed on a system which utilizes many donors and 
one acceptor. Whenever the acceptor is in its ground singlet state (A a ), energy transfer can 
occur. This means that for an acceptor with a lifetime of 1 ns, it can be excited 1 x 10 9 times 
per second, a very large rate compared to that of a donor fluorophore under standard 
illumination (-25,000 excitation events/s). Since the maximum number of excitations that an 
acceptor can undergo is much greater than that of one donor fluorophore, multiple donors can 
transfer their energy to one acceptor. 

Although acceptor excitation to higher energy levels (A x ) and simultaneous donor de- 
excitation events may be considered to be a potential problem resulting in the interference 
with signals generated in a system using multiple donors and a single acceptor the following 
analysis demonstrates that this is not the case. It is improbable that excitation to higher 
energy levels and simultaneous donor de-excitation events will interfere with signal 
generation. 

Acceptor excitation to higher levels means that there theoretically could be further 
excitation from the acceptor's first singlet state (A b ) to the acceptor's higher singlet states 
(A x ). If this were indeed possible, then the maximum number of excitation events in a given 
time increases because the effective lifetime is shorter. A shorter effective lifetime arises 
because the acceptor no longer has to de-excite to undergo energy transfer. Such transitions 
in the acceptor can only occur if the acceptor absorption energies for a transition were 
coincident with the donor emission energies, which is generally not the case for most 
molecules. As a consequence, energy transfer between the donors and the acceptor can only 
occur when the acceptor is in its ground singlet state. 
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In cases of simultaneous donor de-excitation, only one of the donors can transfer 
energy to the acceptor. This means that the other donor is not quenched and emits 
radiatively. If simultaneous events occurred frequently, then the above scenario for multi- 
donor quenching would not be as accurate because a fraction of the donors would be able to 
emit photons. The following probability calculations demonstrate that such events are 
statistically infrequent, supporting the use of multi-donor quenching. 

probability of k simultaneous donor 

emissions occurring in one ns, where n = P n ,k = (Pk)( n C k ) 

is the number of fluorophores 

k-n 

number of donor emissions = (2-l)P n , 2 t = (3-l)P„^t + ... +(n-l)P„,„t = 2 (k-l)P„, k t 
that cannot undergo energy k-2 
transfer 

The aim of these calculations is to calculate the number of simultaneous donor de- 
excitation events for a given amount of time. This number is important because the greater 
the number of overlapping events, the smaller the percentage of donor energy transfer. For 
example it is possible to find the probability of two donor emissions (k=2) simultaneously 
occurring in one nanosecond for four fluorophores (n=4; thus P n ,k = P4,2)- Each emission, in 
this example, lasts for one nanosecond and are represented as blue circles in the grids. In 
addition, each fluorophore emits an average of 25,000 photons/second randomly. The 
variables are defined as n = number of fluorophores, k = number of simultaneous donor 
emissions, P = probability, C = combination, and t = time. The probability for k 
simultaneous events occurring in one ns for n fluorophores (P n ,k) is equal to the probability 
for k simultaneous events (P k ) multiplied by the number of possible combinations that k 
simultaneous events can occur for n fluorophores ( n C k ). The total number of simultaneous 
events is given as the sum of probabilities for all possible values of simultaneous events. 
This gives the probability in the units of (number of events)/ns. The total number of 
fluorophores undergoing simultaneous emission is thus the probability for one ns multiplied 
by the given time (t). It is also necessary to consider the probability of greater than two or 
more donor emissions (P 4 ,3 and P^) occurring in a given time (t). From this information, the 
number of donor fluorophores that cannot undergo energy transfer because of simultaneous 
de-excitation can be derived. 

The probability for two simultaneous donor emissions in one nanosecond for four 
fluorophores is given as: 
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The probability that one ns will contain one emission is 25,000/1 0 9 . The probability that two 
emissions will occur in the same nanosecond is the square (25,000/1 0 9 ) 2 . Since there are four 

5 different fluorophores, there are 6 different possible combinations that give rise to 

simultaneous emission(4!2!2!). These values multiplied give 3.75 x 10" 9 events/ns. This 
value multiplied by 10 9 ns yields 3.75 events/s. In this case, there are two emissions/event 
(k=2) and one of these (k-1) cannot undergo energy transfer so the total number of donor 
emissions that do not undergo FRET is 3.75 events/s. 

10 The above simple calculation can be applied to a system with a greater number of 

fluorophores. In this case, one thousand fluorophores are considered (n=1000). In a similar 

fashion, P 10 oo,2 is calculated to be 3.122 x 10" 4 events/ns. The number of donor emissions that 

cannot undergo energy transfer is 3.122 x 10 5 . In some cases, three simultaneous donor 

emissions can occur. The probability for these events (P 1000,3) is calculated: 

,25000 , 3 ,1000! . 0 , nK rn . 6 

15 =( ) = 2.596x10 

^ ' 10 9 319971 

The number of donor emissions that cannot occur is (3-l)(10 9 )(2.596xl0" 6 ), or 5.192 x 10 3 . 
In a similar fashion, calculations for four simultaneous emissions or greater need to be 
included in the net number of donor fluorophores that cannot undergo energy transfer. This 
number is expressed as the following equation: 
20 k-n 

= (2-l)P n2 t = (3-l)P n . 3 t + ... +(n-l)P„.„t = 2 (k-l)P„. k t 

k-2 

Solving the above summations yields 3.174 x 10 5 donor emissions that do not 
25 undergo energy transfer. The donor emission is calculated from the following values: 25 000 
photons/s per fluorophore, 1000 fluorophores, and a 1 second time interval. This means that 
2.5 x 10 7 photons are emitted per second. The conclusion is that 98.7% of donor events can 
undergo energy transfer. 

The conclusion from the above discussion of a multiple donor/single acceptor system 
30 is that one acceptor can undergo energy transfer with multiple donors. The limitations that 
were considered were the ability to excite an acceptor to higher energy levels and also the 
limitations of simultaneous donor de-excitation. In donor emissions that occur concurrently, 
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only one of the de-excitations would be able to undergo energy transfer with the acceptor. If 
this occurred often, then the acceptor would not be able to undergo energy transfer with all 
donor emission events. Probability calculations show that for a one thousand donor/one 
acceptor system under standard epiillumination conditions, 98.7% of donor emission events 
can undergo energy transfer with an acceptor. The overall conclusion from these detailed 
analyses is that a multi-donor system/one acceptor system can indeed exist and that a single 
acceptor can undergo energy transfer with a large number of donor molecules. 

The following description sets forth how conditions can be optimized for the donor 
and the acceptor fluorophores of the multi-donor/one acceptor system. In order to optimize 
the system two potential problems which may arise when this system is actually performed 
should be considered. These are photobleaching and solvent quenching (Rost, 1990; Menter 
et al., 1978; Giloh and Sedat, 1982; Vaughan and Weber, 1970; Guilbault, 1973; Udenfriend 
1962; Pringsheim, 1963). These effects may limit the number of excitation cycles a 
fluorophore can undergo. Whereas in an ideal system a fluorophore can undergo infinite 
excitation cycles, in practice, the actual number of cycles is limited to a finite number 
depending on both the solvent conditions and intensity of the excitation light. The signal 
produced in a multiple donor/one acceptor system, however, can be optimized under certain 
experimental conditions. 

The theory for optimizing and generating an efficient signal follows. The rationale is 
that as long as model conditions are simulated as close as possible, signals are generated. 
Recall that ideal fluorophores can undergo infinite cycles of excitation. If realistic 
fluorophores behave in a similar fashion, then any number of fluorophores can be detected as 
long as temporal integration of photonic emissions is sufficient. One method for achieving 
close to ideal conditions is by immobilizing fluorophores in a solid medium. Embedding 
fluorophores in solid media can abolish both photobleaching and quenching (Haughland, 
1996; Garland and Moore, 1979; Rost, 1991) and that certain types of fluorophores are 
especially stable (such as ferulic acid in plant cell walls, uranyl glass, and Acriflavine) (Rost, 
1991). This will dramatically increase the number of possible excitation cycles. Despite 
being embedded in a solid medium, donor fluorophores can generate fluorescent signals for 
several hours (Rost, 1991) and can still undergo FRET with acceptors (Stryer, 1978; Cantor 
and Schimmel, 1980) and vise versa. Furthermore it is known that the acceptor can undergo 
resonance energy transfer with donors for several hours in solvent conditions (Wittwer, 
1997; Lee et al, 1994; Uchiyama et al., 1996; Livak et al., 1997). The ability for an acceptor 
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to quench a donor that has a stable signal translates into the ability to generate highly efficient 
signals. 

It has been shown that an acceptor can undergo energy transfer with a donor for an 
extended period of time in solvent conditions (Wittwer, 1997; Lee et al., 1994; Uchiyama et 
5 al., 1996; Livak et al., 1997). The evidence arises from experiments done on peptide and 
nucleic acid cleavage assays. In these assays, a particular substrate is labeled at either end 
with a donor and an acceptor fluorophore. Since the length of the substrate is within range of 
the Forster distance of the donor-acceptor pair, the fluorescence of the donor is quenched. 
Monitoring the donor fluorescence allows a light based assay of cleavage. In this manner, an 
10 increase in donor fluorescence is directly proportional to the cleavage activity of the 

particular enzyme. Assays of this type, commonly called fluorometric cleavage assays, have 
been used to study a number of systems including HIV proteases (Matayoshi et al., 1990; 
Toth and Marshall, 1990), neutral proteases (Ng et al., 1989), EcoRV restriction 
endonucleases (Erskine and Halford, 1994), PaeR7 endonuclease (Ghosh et al., 1994), DNA 
15 polymerase I 5'-3' exonuclease activity (Wittwer et al., 1997; Livak, 1997), thermolysin 

(Yang and Van Wart, 1994), and papain (Garcia-Echeverria and Rich, 1992). The relevance 
of these experiments is important because they contain evidence that an acceptor can quench 
a donor for extended periods of time. The controls in fluorometric cleavage assays include 
the monitoring of uncleaved substrates (in the absence of the cleavage enzymes) for the 
20 duration of the experiment. For especially lengthy experiments, these controls need to be 

monitored for an extended period of time. The lack of increase in donor fluorescence in these 
controls demonstrates directly that an acceptor can quench a donor for a significant length of 
time. 

The acceptor maintains its ability to transfer energy for an extended period of time 
25 because its energy states remain unchanged. Energy transfer occurs when the acceptor 
absorption energies are coincident with the donor emission energies. The result of energy 
transfer is a ground singlet state (A a ) to excited singlet state (A b ) conversion. The acceptor 
loses energy through internal conversion, either fluorescent or non-radiative. The studies 
cited above demonstrate that despite non-ideal experimental conditions, the acceptors 
30 maintain their relative energy levels and hence their ability to undergo energy transfer. 

The number of excitation cycles that a fluorophore generally undergoes in a solution 
is roughly 35,000 cycles (Rost, 1991). The number of fluorophore excitation cycles in 
solution, however, is limited by the solvent effects discussed above such as oxygen 
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quenching, collisional quenching, and excited state reactions. Solvent conditions can be 
adjusted so that the number of excitation cycles is optimized (Haughland, 1996). Various 
chemical methods are used to optimize conditions for detection of acceptor fluorescence. 
Sodium azide (NaN 3 ), sodium iodide (NaT), dithiothreitol (DTT), dithioerythritol (DTE), 

5 sodium dithionate, ^-propyl gallate, ascorbic acid, and polyvinyl alcohol (PVA) all have been 
found effective with various fluorophores (Bock et al, 1985; Johnson et al., 1982; Picciolo 
and Kaplan, 1984; Gill, 1979; Giloh and Sedat, 1982; Valnes and Brandtzaeg, 1985). b- 
mercaptoethanol, sodium nitroprusside, and incorporation of electron donors and molecules 
with SH groups have also been found to be effective (Franklin and Filion, 1985; Spatz and 

10 Grabig, 1983; Hamada and Fujita, 1983). In addition, commercial reagents for reducing 
solution fluorophore fading are available. SlowFade (Molecular Probes, OR) formulation 
reduces the fading rate of fluorescein to almost zero. Because it provides a nearly constant 
emission intensity from fluorescein, the SlowFade reagent is especially useful for quantitative 
measurements and applications that employ a confocal laser scanning microscope, in which 

15 the excitation intensities can be extreme and prolonged. For a system under the proposed 
optimized conditions, both initial donor emission and the change in donor emission in the 
presence of acceptors have efficiencies close to that of ideal conditions. 

A "detectable signal" as used herein is any type of signal which can be sensed by 
conventional technology. The signal produced depends on the type of agent or station as well 

20 as the unit and the proximate compound if present. In one embodiment the signal is 

electromagnetic radiation resulting from light emission by a labeled (intrinsic or extrinsic) 
unit of the polymer or by the proximate compound. In another embodiment the signal is 
fluorescence resulting from an interaction of a radioactive emission with a scintillation layer. 
The detected signals may be stored in a database for analysis. One method for analyzing the 

25 stored signals is by comparing the stored signals to a pattern of signals from another polymer 
to determine the relatedness of the two polymers. Another method for analysis of the 
detected signals is by comparing the detected signals to a known pattern of signals 
characteristic of a known polymer to determine the relatedness of the polymer being analyzed 
to the known polymer. Comparison of signals is discussed in more detail below. 

30 More than one detectable signal may be detected. For instance a first individual unit 

may interact with the agent or station to produce a first detectable signal and a second 
individual unit may interact with the agent or station to produce a second detectable signal 
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different from the first detectable signal. This enables more than one type of unit to be 
detected on a single polymer. 

The detectable signal is produced at a station. A "station" as used herein is a region 
where a portion of the polymer to be detected, e.g. the unit, is exposed to, in order to produce 

5 a signal or polymer dependent impulse. The station may be composed of any material 

including a gas. Preferably the station is a non-liquid material. "Non-liquid" has its ordinary 
meaning in the art. A liquid is a non-solid, non-gaseous material characterized by free 
movement of its constituent molecules among themselves but without the tendency to 
separate. In another preferred embodiment the station is a solid material. In one 

10 embodiment when the unit interacts with an agent the station is an interaction station. The 
station may also be a signal generation station, which is discussed in more detail below. As 
used herein an "interaction station or site" is a region where a unit of the polymer and the 
agent can be positioned with respect to one another in close enough proximity whereby they 
can interact. The interaction station for fluorophores, for example, is that region where they 

15 are close enough so that they energetically interact to produce a signal. 

The interaction station in a preferred embodiment is a region of a nanochannel where 
a localized agent, such as an acceptor fluorophore, attached to the wall forming the channel, 
can interact with a polymer passing through the channel. The point where the polymer passes 
the localized region of agent is the interaction station. As each labeled unit of the polymer 

20 passes by the agent a detectable signal is generated. The agent may be localized within the 
region of the channel in a variety of ways. For instance the agent may be embedded in the 
material that forms the wall of the channel or the agent may be attached to the surface of the 
wall material. Alternatively the agent may be a light source which is positioned a distance 
from the channel but which is capable of transporting light directly to a region of the channel 

25 through a waveguide. An apparatus may also be used in which multiple polymers are 

transported through multiple channels. These and other related embodiments of the invention 
are discussed in more detail below. The movement of the polymer may be assisted by the use 
of a groove or ring to guide the polymer. 

Other arrangements for creating interaction stations are embraced by the invention. 

30 For example, a polymer can be passed through a molecular motor tethered to the surface of a 
wall or embedded in a wall, thereby bringing units of the polymer sequentially to a specific 
location, preferably in interactive proximity to a proximate agent, thereby defining an 
interaction station. A molecular motor is a compound such as polymerase, helicase, or actin 
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which interacts with the polymer and is transported along the length of the polymer past each 
unit. Likewise, the polymer can be held from movement and a reader can be moved along 
the polymer, the reader having attached to it the agent. For instance the agent may be held 
within a scanning tip that is guided along the length of the polymer. Interaction stations then 
5 are created as the agent is moved into interactive proximity to each unit of the polymer. 

Once the signal is generated it can then be detected. The particular type of detection 
means will depend on the type of signal generated which of course will depend on the type of 
interaction which occurs between the unit and the agent. Many interactions involved in the 
method of the invention will produce an electromagnetic radiation signal. Many methods are 
10 known in the art for detecting electromagnetic radiation signals, including two- and three- 
dimensional imaging systems. 

Three-dimensional imaging systems for imaging fluorescence in biological systems 
has recently been described. Dickson et. al., have described three-dimensional imaging of 
single molecules in the pores of a gel. Dickson et. al., Science, 274:966-969 (1996). Dickson 
15 et al. examine the effects of confined environments on Brownian motion of single fluorescent 
molecules. Single molecules of free fluorescent dye and fluorescent dye bound to protein 
were trapped in polyacrylamide (PAA) gels. Nile red was incorporated into polyacrylamide 
gels having pores on the order of 2 nm. The fluorophore gels were excited with an 
evanescent wave generated by total internal reflection (TIR) to detect movement or 
20 immobilization of single molecules within the pores of the gels. The TIR method produces 
three-dimensional information about the molecules in the pores by using the exponential fall- 
off in excitation intensity to determine the distance of the fluorophore from the interface. The 
discrepancy between observed motion and theoretical Brownian motion was studied by both 
cases. The Brownian motion for free fluorophores was reduced by a factor of 10 . The 
25 labeled proteins remained completely stationary in space. By direct analogy, the spatial 
confinement of the nanochannels should limit or eliminate the Brownian motion of the 
labeled DNA in nanochannel FRET sequencing. This would allow a stable and predictable 
passage of the DNA through the nanochannels. 

An example of a microscopy system useful according to the methods of the invention 
30 is provided in Ishijima et al, Cell, 92:161-171 (1998). 

Two-dimensional imaging systems are important because they have, among other 
parameters, low noise, high quantum efficiency, proper pixel-to-image correlation, and 
efficient processing times. An example of a device useful for detecting signals is a two- 



-48- 

dimensional fluorescence imaging system which detects electromagnetic radiation in the 
fluorescent wavelength range. 

There are three categories of fluorescence imaging devices based on the type of 
fluorescence signal measured, either intensity, lifetime, or spectra. Intensity signals can be 

5 captured by a variety of methods including charge coupled device (CCD) camera, streak 
cameras, and silicon diode arrays. Of these methods, the most common is the CCD camera 
because of its wide commercial applications. The streak camera offers a superior temporal 
resolution down to the femtosecond. Silicon diode arrays have superior dynamic range, 
signal to noise ratios, and temporal resolution (common frame rates at 1000/s), but have 

10 larger picture elements (25 uM x 500 uM as opposed to 20 uM x 20 jjM for a CCD). Each of 
the devices may be used with the methods of the invention but CCD cameras are preferred. 

Lifetime and spectral imaging are performed using a combination of instruments 
including gated image intensifiers, pulsed lasers, and CCD cameras. Time-gated methods, 
which are lifetime-related techniques, involve temporally discriminating fluorescence signals 

15 from background and autofluorescence. Periasamy et. al., which is hereby incorporated by 
reference provides a review of Time-gated fluorescence microscopy for clinical imaging. 
Periasamy et. al., Microscopy and Analysis, 33-35 (1995). Lifetime imaging uses time-gating 
or phase-modulation techniques to determine fluorescence lifetime. Spectral imaging 
determines the emission spectrum at each pixel. Time-gated and lifetime imaging techniques 

20 offer many advantages to simple CCD imaging such as better signal-to-noise ratios and 
greater dye specificity and thus also are preferred devices for the detection method of the 
invention. 

The first type of imaging technique, intensity imaging generally involves the use of a 
CCD or ICCD camera to independently capture intensity signals. This is a desirable method 
25 for detecting signals according to the methods of the invention because of its simplicity. 
CCD and ICCD cameras can be readily purchased from a number of suppliers (i.e. 
Photometries, Hamamatsu, Princeton Instruments). 

CCDs are two-dimensional silicon matrices that have many light sensitive elements 
called pixels that can hold electronic charges generated by photon interaction. Exposure of 
30 the CCD to photonic fluorescent emission causes accretion of charge in the individual pixels. 
After an exposure is complete, entire rows of accumulated charge are transported towards a 
serial CCD register. In the serial register, individual pixel charge packets are transported to a 
read-out amplifier that creates a signal proportional to the amount of charge. Each row of the 
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CCD is read in a similar fashion until the image is successfully converted to a two- 
dimensional series of signals. Charge transfer between the CCD and the serial CCD register 
is extremely efficient (99.9999% efficient). The output amplifier creates a linear response to 
the measured analog signals and the image is digitized between 8 and 16 bits. 
5 Upon read-out of the image, there are two options that CCD cameras usually employ 

to modify images. These are subarray read-out and binning, or charge-grouping. A 
programmed CCD camera can selectively process pixels in a defined region of interest. The 
advantage of defining regions of interest is that it allows faster image read-out to occur. The 
time to process and digitize each pixel is fixed, so that smaller desired regions allow for 
10 higher frame rates. Smaller regions are possible from subarray read-out. For instance a 100 
x 100 pixel in a 1024 x 1024 pixel CCD may be read at a frame rate of 100 frames/s at a pixel 
read frequency of 1 MHZ. Binning, which is a combination of smaller pixels into one 
effective larger pixel for faster readout times, on the other hand, allows combination of 
charge from several pixels. During binning, the CCD operates at reduced resolution for 
15 increased dynamic range and a higher frame rate. The dynamic range is the ratio of the 
largest signal a CCD can detect linearly to the basal readout noise. For example, a 1024 x 
1 024 pixel CCD binned 4x4 yields a 256 x 256 image which has large pixels that are 
effectively 16 times larger than the unbinned version. The image is also read-out at 16 times 
the rate. The specific operational mode of the CCD is dictated by the components of the 
20 camera and a host computer. 

To understand the control of the CCD, there are three components of a typical CCD 
that need to be discussed, including the camera controller, electronics unit, and camera head. 
The controller acts as the intermediary processor of input and output between the host 
computer and the camera. The controller contains logic which causes the camera to act on 
25 certain host commands. In addition it also relays digitized pixel data to the computer. 

Internally, the controller has sequences necessary for coordination of the CCD phases and 
timing of the analog processing units. The electronics unit convert digital commands into 
active clocking signals and sequences. Furthermore, the unit contains the AID converter. 
Digitizers from 8 to 16 bits are commonly used. The camera head contains the CCD and 
30 often a cooling device. Liquid nitrogen or Peltier cooling are common. 

With a basic understanding of the operation of a CCD camera, performance is the 
most important consideration for the camera. Specifically, these parameters are noise, 
linearity, quantum efficiency, and temporal resolution. There are four components of noise 
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that are important. They are dark current, read noise, shot noise, and lamp noise. Dark 
current is the leakage current within the CCD. The charge accumulates even in the dark, 
hence the term. The lower the CCD temperature, the lower the dark current. Thermoelectric 
coolers or liquid nitrogen can lower the CCD temperature to reduce the dark current. 

5 Temperatures of - 1 20°C can reduce the dark current by several orders of magnitude. Multi- 
pin phasing (MPP) is a new CCD technology that can reduce dark current by a factor of 100 
or more. Read noise is the stochastic electronic charge generated at higher frame rates arising 
from the output preamplifier. As the collection rate increases, so does the read noise. Read 
noise can be significantly reduced when the temperature is lowered below -60°C. Shot noise, 

10 also commonly called photonic noise, arises because of the quantum nature of light. It is the 
square root of the photonic signal. At low light levels, the relative shot noise is high and 
often masks the desired signal. Lamp noise is due to intensity fluctuations in the illuminating 
source and can be controlled by using stable power supplies. Lamp noise is very often 
overlooked in quantitative applications of CCD cameras. 

15 Photometric linearity becomes especially important during quantitative fluorescence 

microscopy. For use in the methods of the invention, concern for linearity is not overly 
important because in general linear operators are not being applied to the CCD data. Since 
the charge generation mechanism of a CCD is intrinsically linear, the output signal is 
precisely proportional to the charge. The electronics between the CCD and the digitizer 

20 provide room for deviations. High scientific grade CCDs usually exhibit linearity with less 
than 0. 1% deviation. In most cases, the linearity is difficult to measure because it is more 
linear than techniques used to test it. 

Quantum efficiency is the fraction of photons that reach the CCD that interact to 
generate electron-hole pairs in the CCD. Quantum efficiencies range from 10% on low-grade 

25 CCDs to as high as 80% on back illuminated scientific grade CCDs. There are also spectral 
variances of the quantum efficiency, with peaks occurring usually in the visible wavelength. 
Back illuminated CCDs have much greater quantum efficiencies at all spectral positions. 

Temporal resolution of a CCD camera becomes important when discussing the 
possible frame rate of the camera. Exposure times vary for given applications and the 

30 exposure time can be adjusted by the number of fiuorophores used and the intensity of the 
excitation light. The limitation to the temporal resolution in a given CCD camera is a 
function of the analog-to-digital converter operating frequency and also the image size. For 
example, an A/D converter operating at 1 MHZ can read 100 frames of a 100 x 100 image in 
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one second. In addition, the higher the dynamic range desired, the slower the read frequency. 
For example, it is difficult for current CCD cameras digitizing at 14 bits per pixel to operate 
above 1 MHZ. Use of multiple serial register devices overcomes the temporal limitations 
imposed by one A/D converter. The speed increase with such a device is proportional to the 
5 number of available output channels. A 14 bit 100 x 100 pixel image acquired by a CCD 
camera with 4 output registers operating at 1 MHZ can acquire 400 frames per second. The 
maximum rate limitation that cannot be overcome is dictated by the pixel read-out times, 
which vary from 20 us to 500 ns. For a 100 x 100 pixel image, even the slowest pixel read- 
out time can allow 500 frames/s. Hence, read-out times are rarely a limitation on the frame 

10 rate, but rather it is the A/D converter that is limiting. As evidence of this, frame rates of 
2000/s have been possible with a 128 x 128 array (Ichikawa et al., 1992). 

A CCD camera which is particularly useful in the methods of the invention is a large 
pixel, low noise, and short scan time camera. Large pixel sizes have greater well capacities 
and allow for greater collection of photons and hence a greater maximal signal-to-noise ratio 

15 (SNR). The basis of intensity change discrimination relies on a large SNR, as is to be 

discussed shortly. Accordingly, low noise aids in improving the SNR. MPP type CCDs with 
smaller well capacity and lower dynamic range are not well suited. The temporal resolution 
is high from the short scan time, allowing a high volume data stream that should be storable 
in real time in the computer RAM or on the hard drive. The quantum efficiency should be 

20 reasonable. The linearity of the CCD is not critical, but assumed to be scientific grade 

(0.1%). The dynamic range is not critical because the donor fluorophores that are images are 
approximately the same intensity. 

An intensified CCDs (ICCDs) camera consists of a photocathode, microchannel plate 
(MCP), and a phosphor screen in addition to the CCD camera. Fluorescence light impinges 

25 on the photocathode, releasing photoelectrons into the MCP. The MCP is a secondary 

electron multiplier consisting of an array of millions of glass capillaries (channels) fused into 
the form of a thin disc. When photoelectrons are incident upon the channels, secondary 
electrons are produced. Passage of the electrons through the channels produces an 
amplification that is directly controlled as the gain of the instrument. Each channel of the 

30 MCP corresponds to a picture element. The output of the MCP is focused onto a phosphor 
screen where electrons exiting from the MCP strike. The optical image is reproduced on the 
phosphor screen and is captured by a CCD camera. 
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In some instances, it is preferred to use a ICCD. The addition of the image intensifier 
offers high speed gating and high gain. Gating is probably the most important aspect of the 
ICCD. It is the electronic shutter action produced by controlling the input voltage to the 
image intensifier. At present, time-resolved imaging methods are possible with nanosecond 

5 and sub-nanosecond gating times (Thomas et al., 1992). Gating allows for greater signal 
detection because of temporal discrimination of background signals. Examples of temporal 
discrimination and enhancement of signal-to-noise ratios is seen in time-gated fluorescence 
microscopy and the pulse method of fluorescence lifetime imaging microscopy. The gain of 
the image intensifier is used to increase the sensitivity of the camera and also serves as a tool 

10 in the phase-modulated method of fluorescence lifetime imaging spectroscopy. The 

electronic gain can be as high as 10,000:1. This allows faint optical signals to be amplified 
over read noise. 

Single photon events can be detected with an ICCD. Single fluorophore imaging, for 
instance, has been achieved by Sase et al., 1995 using a CCD camera, image intensifier, and 

15 an epifluorescence microscope. In "Real Time Imaging of Single Fluorophores on Moving 
Actin with an Epifluorescence Microscope," Sase et al. demonstrate that single fluorophores 
can be imaged in real time with a high detection efficiency. Other methods that have 
achieved solution single molecule sensitivity include fluorescence correlation spectroscopy 
(Eigen and Rigler, 1994; Kinjo and Rigler, 1995), far-field confocal microscopy (Nie et al., 

20 1994), cryogenic fluorescence spectroscopy (Kartha et al., 1995), single molecule photon 

burst counting (Haab and Mathies, 1995; Castro and Shera, 1995; Goodwin et al., 1995; Peck 
et al, 1989; Nguyen and Keller 1987; Lee et al., 1994; Chen and Dovichi, 1996; Shera et al, 
1990; Soper et al., 1992), two-photon excited fluorescence (Mertz, 1995), and 
electrochemical detection (Fan and Bard, 1995). 

25 A numerical SNR value can be calculated based on the desired aspects of a CCD 

camera. For example, a CCD that satisfies the criteria set out above is an EEV 05-20 CCD 



(Princeton Instruments, (Princeton, NJ) which has the specifications set forth in Table 1 : 



Table 1 


CCD format 


1152x770 


dynamic range, bits 


14 to 17 


pixel size, jam 


22.5 x 22.5 


non-linearity, % 


<1 (16 bits) 


fall well capacity, ke- 


500 


dark charge at -120°C, 
electrons/pixel-hour 


<1 


readout noise, e- 


4-6, 50 kHz 
22,500 kHz 


quantum eff., %peak 


40 



-53- 

In order to calculate a SNR, an equation that considers the various noise values is 
needed. In any detection system, there is always a basal level of noise that may hinder 
detection of signals that are weak and intermittent. Intuitively, the larger the desired signal, 
the less important the basal level of noise becomes. A larger signal and hence a larger SNR 
5 can be attained by increasing either the number of fluorophores or the excitation intensity. 
The equation for calculating the SNR is 

SNR= . PE 

<J( Nlho, + Ndark + Nlead + NLp) 

pE= IJptENG 
ho 

PE represents the number of photoelectrons emitted from the detector and is related to the 
10 efficiency (E) of the detector system. The higher the efficiency (E), the greater the number of 
photoelectrons emitted for a given number of photons. PE is also related to the number of 
fluorophores (N), pre-amplifier gain (G), integration time (t), intensity of light (I), molar 
extinction coefficient of the fluorophore (s), and a fluorescence constant specific for the 
chosen fluorophore (p). N S hot represents the noise due to random fluctuations of the 
15 fluorophore emission and is related to the magnitude of the signal produced. The larger the 
signal, the larger the shot noise. Nd ar k and N rea d are dark noise and read noise. Niamp is the 
noise generated from the illuminating source. It is important to note that Namp can have large 
influences in the SNR. An illumination source that has a 1% intensity fluctuation can have a 
SNR only as large as 100:1. In this case, the lamp noise is minimized by using a stable 
20 power supply. For example, commercially available (Uniphase) helium-neon lasers with 

modified power supplies have intensity fluctuations of less than 0.002% of the total intensity. 
Furthermore, a tungsten filament lamp can be equipped with a stable power supply so that the 
output of the bulb fluctuates less than 0.001%. Table 2 lists the values for the variables and 
provides the reason the particular values were chosen. 



Table 2 


variable 


value 


reason 


I 


30 W/cnV 1 


intensity of 2 W laser is given by P/A. A is the beam area. 
Intensity of laser is 64 W/cm 2 


6 


91,000 1/Mcm 


molar extinction coefficient for fluorescein 


P 


3.8x 10- zl Mcm J 


constant for fluorescence emission of fluorescein 


t 


0.010 s 


integration time for CCD operating at 100 frames/s 


E 


0.025 


collection efficiency of system including filters, objective, and 
quantumefficiency of CCD 


N 


1000 


reasonable number of donor fTuorophores/pixel 
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Table 2 


G 


60 


typical pre-amplifier gain; gain for ICCD can be as high as 
10,000:1 


h 


6.6261 x 10° 4 J s 


Planck's constant 


V 


6.1224xlO i4 xs" i 


c=vX; A.=490nm (excitation of fluorescein) 


N shot 


632 e 1 


N shot = P£ 


Niamp 


8e" x 


0.002% intensity fluctuation 


N re ad 


40 e" 1 


estimation for EEV 05-20 CCDat high readout speeds 


Ndark 


~0 


value for EEV 05-20 CCD at -120°C 


SNR 


631:1 


from the above equation 



A SNR of 63 1 : 1 is high. A high SNR ensures efficient detection of both intensity and 
intensity changes, or in other words, donor emission in the absence and presence of an 
acceptor. For example, a SNR of 63 1 : 1 means that there is a 66% confidence of detecting a 
5 0. 158%o (1/632 x 1 00) change in the signal. There is a 95% confidence in detecting a 0.3 16% 
change and a 99.9% confidence for a 0.475% change. The larger the intensity change, the 
greater chance of detection. A higher baseline SNR in the absence of any intensity changes 
allows for greater confidence intervals for a given percentage intensity change. In order to 
maximize the confidence of detection of a signal change, it is important to generate large 
10 percentage changes in the presence of acceptors.. 

One method for generating a large percentage change is to cluster multiple donor 
fluorophores around the interaction station, e.g., in a concentric ring which the polymer can 
pass through, ensuring that all donor fluorophores will undergo energy transfer with the 
acceptor. An appropriate width of such a concentric ring of donor fluorophores can be 
15 determined by the rate of emission of the donors and also the Forster distance of the donor- 
acceptor pair. Typical changes in intensity upon acceptor passage range from 30% to 50%, 
which correspond to an unequivocal 100% confidence. 

Before the CCD camera can process a signal each signal generated must be captured 
by pixels of the detector system. Each pixel should be capable of capturing a signal from a 
20 single interaction station and should have the ability to detect transient changes in the signal. 
The area of an interaction station, e.g., the localized region of agent on a nanochannel, 
detected by one pixel is determined by the pixel size on the detector, the magnification of 
the image, and also the diffraction limit of the wavelengths to be measured. The relationship 
between the measured area, detector pixel size, and magnification is given by the following 
25 equation. 
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The size of the area to be measured is given by a 2 where a is the edge length. The edge 
length of the detector pixel is given by d. The magnification is given by M. Using 
convention values of these variables, with d = 1 5 uM and M - 60x, the edge length of a 
measured area turns out to be 500 nm, well within range of the resolving power of a 500 nm 
5 wavelength signal. The resolving power for a microscope sample under epiillumination is 
presented in the following equation (Matsumoto, 1993). 



R f = 0.6U/NA 



10 Rf is the minimum distance between two bright points that can be resolved. A, is the 
wavelength of the donor fluorescent emission. NA is the numerical aperture of the 
microscope objective. The highest numerical aperture is desired because of the inverse 
relationship between NA and the resolving power. NA is given by the following classic 
criterion for lens selection (Taylor and Salmon, 1989): 

15 NA = nsin® 

n is the index of refraction of the immersion medium. It is often desirable to use a higher 
index medium such as oil. 9 is the angle between the optical axis and the greatest marginal 
ray entering the lens. For quality microscope objectives at high powers, the numerical 
aperture can be as high as 1 .4. From the above two equations, the resolving power for a 500 

20 nm emission signal and a 1 .4 NA lens becomes 2 1 8 nm. 

The conclusion from the above calculations is that two adjacent pixels on the 
detection unit can each detect signals from their respective interaction stations without 
confusion of the origins of the signals. By carefully calculating the magnification and the 
pixel size cross-interference between pixels can be avoided. For example if the smallest pixel 

25 size (d = 6 uM) and the largest magnification (M = lOOx), were used a 120 nm edge length 
on the detected area is achieved. Since the resolving power of the system remains constant at 
218 nm, interference of signals would exist. This can be avoided by performing calculations 
prior to defining the experimental set up. Hence, by adjusting the magnification and the pixel 
size on the detection system, an optimal number can be reached where the measured area 

30 matches the diffraction limit. This can be determined from a combination of the above two 
formulas by setting the edge length equal to or greater than the resolving power of the 
detection system: 
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2d/M>0.6WNA 

the density of the interaction stations can be varied so that only one interaction station 
corresponds to one pixel of the detector system. For example, if a 4 x 10 6 nanochannels/cm 2 
plate is used, 0.01 nanochannels (interaction stations) are found per pixel using a 60x 
5 magnification and a 15 uM pixel size. The interaction station density can be adjusted even 
lower, e.g., an average minimum inter-pore distance of 48 uM is also within the appropriate 
range. 

In addition to intensity imaging both time-gated fluorescence microscopy and 
fluorescence lifetime imaging may be used to detect signals according to the methods of the 

10 invention. Time-gated fluorescence microscopy and fluorescence lifetime imaging are more 
involved methods that have advantages such as temporal discrimination of fluorescent signals 
and better signal-to-noise ratios than intensity imaging. 

Time-gating is desirable if there is significant background scattering or 
autofluorescence. Background light scattering is a problem when the scattered wavelengths 

15 are equal to the emission wavelengths of the desired fluorophores. Scattering effects can be 
avoided by using a fluorophore with a large Stokes shift so that the scattered wavelengths are 
shorter than the detected wavelength. In this case, the scattered light can be eliminated by 
using optical filters. On the other hand, autofluorescence is a common problem affecting 
essentially all studies employing fluorescence microscopy. Autofluorescence can arise from 

20 solvents, solutes, and the optical components of the microscope system. Autofluorescence 
decreases the signal-to-noise ratio of detection. This is the case even though there have been 
many improvements in the various components of the detector systems (Periasamy and 
Herman, 1994). 

Time-gated fluorescence microscopy (TGFM) utilizes differing fluorescence lifetimes 
25 to make a distinction between autofluorescence and fluorescence. Lifetimes of fluorescent 
dyes can be chosen to be longer than that of autofluorescence. Short lived autofluorescence 
decays to zero in less than 1 us whereas long lived fluorescence can demonstrate lifetimes 
from 1 fxs to 10 ms (ex. europium chelates, lanthanide chelates). Excitation of the sample is 
done with a brief intensity pulse, shorter than the lifetime of either the autofluorescence or 
30 fluorescence. Exponential decay follows. If measurements are made only after the decay of 
the fluorescent signal, then the longer lived signals are measured with a greater sensitivity. 

An example of an imaging apparatus for TGFM is provided in Periasamy, 1995. A 
fluorescent microscope (Nikon) with epiillumination capabilities and a continuous wave 
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(CW) laser (Coherent or Spectra-Physics) are shown in the reference, emitting at the desired 
excitation wavelength. The excitation laser light is chopped by a chopper with a chopper 
control to create laser pulses with defined pulse widths. The intensity of the light is 
controlled using a variable neutral density filter (Omega Optics). To delay the time of 

5 measurement, a delay pulse generator is used to generate a signal for controlling a high 
frequency gated image intensifier (Hamamatsu) or a chopper in the emission light path. A 
CCD camera (Princeton Instruments or Photometries) is used to collect the signals. 

The fluorescence lifetime represents the average amount of time a molecule remains 
in the excited state prior to its return to ground state. There are two methods for the 

10 measurement of fluorescence lifetimes. These are the pulse method and the phase- 
modulation method (Lakowicz, 1986; McGown, 1989; Gratton and Limkema, 1983). In the 
pulse method, the sample is excited with a brief pulse of light and the time-dependent 
fluorescent decay is measured. In the phase-modulation method, the sample is excited with a 
sinusoidally modulated light. The phase shift and demodulation is used to calculate the 

15 lifetime. Until recently, lifetime measurements were only used with cuvette samples. In the 
past five years, there has been development of methods that combine microscopic two- 
dimensional resolution and high-resolution lifetime measurements (Rodgers and Firey, 1985; 
Wang et al, 1990; Morgan et al, 1990; Clegg et al., 1991; Lakowicz and Berndt, 1991; 
Buurman et al., 1992; van de Ven and Gratton, 1992; Oida et al., 1993). The development of 

20 fluorescence lifetime imaging microscopy (FLIM) has allowed the detailed study of location 
and environment of fluorescent labels in cells and other microscopic samples. In the 
following, the advantages, theory, and applications of FLIM are discussed. 

Fluorescence lifetime measurements have been used for a variety of reasons including 
specificity, sensitivity, quantitation, and high temporal resolution (Wang et al., 1996). 

25 Measurement of lifetimes provide high specificity because fluorescent molecules have 
distinct lifetimes. In comparison to absorption and emissions, lifetimes provide greater 
discrimination of molecules. Lifetimes can also be carried out on small amounts of 
molecules, leading to similar sensitivities as intensity measurements. Quantitation of 
molecules through lifetimes provide a true measurement because fluorescence lifetime is 

30 directly related to the fluorescence quantum yield of the fluorophore. Lastly, lifetimes can be 
used to detect temporal events that occur on the time scale of biomolecular processes, usually 
between a picosecond and a microsecond. 
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There are two methods of determining lifetimes. The pulse method is described first 
(Lakowicz, 1986). Consider an short pulse of light exciting a population of fluorophores. 
The fluorescent signal of the excited molecules decays with time in a first order manner, 
given as the following exponential function: 

5 I(t) = A^ x> 

a is an arbitrary constant, t is the time, and t is the fluorescent lifetime. Intuitively, the 
fluorescent lifetime is the time required for the intensity to decay to 1/e of the original value, 
a decay of 63%. One of the methods to experimentally measure the lifetime is to use a pulse- 
sampling method (Herman et al., 1996). Di and D 2 are collected on consecutive frames. 

10 Frames are analyzed with a lifetime equation and a Figure 2 and 2-D lifetime array is 

generated. Following each excitation event, the multichannel plate gated image intensifier 
(MCP-GII) attached to the CCD camera is turned on for a very brief interval (i.e. 4 ns) at 
some time interval (ti) after the exciting pulse. The emission is acquired on a CCD that is 
continually on. The identical process is repeated for a large number of times to capture a 

15 sufficient signal on one frame of the CCD. After a sufficient signal at ti is generated, the 
CCD is read out and the gate window with respect to the excitation pulse (t 2 ) is shifted and 
the whole process is repeated. Interpretation of the two frames with a pixel by pixel analysis 
gives the lifetime of the image at each point. 

Wang et al., 1996 describe the apparatus for pulse FLIM as consisting of five main 

20 components: 1) pulsed light source; 2) image detection system (gated image intensifier and 
CCD camera; 3) timing control unit; 4) and fluorescence microscope. The system is identical 
to the apparatus described for time-gated fluorescence microscopy (TGFM) described above, 
except for the pulsed light source which is a picosecond pulsed light source (Coherent) 
consisting of a mode-locked YAG laser, a dye laser with a third harmonic generator, and a 

25 cavity dumper. Picosecond pulses having tunable wavelengths from UV to IR. Rates from 
single shot to 76 MHZ are generated. 

The second method of fluorescence lifetime determination is by the phase-modulation 
method. Instead of using a pulsed light source of excitation, the method uses a light whose 
intensity is modulated sinusoidally. The emission of the sample, therefore, follows the same 

30 sinusoidal variations. The modulated emission is delayed in phase because of the excited 

lifetime of the fluorescent molecules. The magnitude of the phase shift (cp) is directly related 
to the lifetime of the fluorophore. Furthermore, there is demodulation of the emission. In 
other words, the amplitude of the final emission is smaller in amplitude than that of the 
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excitation light. Both the phase angle (q>) and the demodulation factor (m = BA/ba) are 
measured and used to calculate the phase (t p ) and modulation lifetimes (x m ) (Lakowicz, 
1986). 

_tan^ _sj[(l/ m 2 )-l 

Tp Tm 

CO CO 

5 In single exponential decay, t p = x m = t, the actual fluorescence lifetime. 

The basic theory behind phase modulation lifetime determination can be applied to a 
two-dimensional imaging system. The following description is based on from Gadella et al., 
1993 and Lakowicz and Szmacinski, 1996, each of which are hereby incorporated by 
reference. The method uses a gain-modulated image intensifier that generates an image that 
10 has an intensity related to the phase shift of the emission signal. With use of several phase- 
sensitive images collected with various electronic delays or phase shifts, it is possible to 
calculate the lifetime image of the object. To understand this further, it is necessary to 
present the equation that describes the time-averaged, phase-sensitive intensity from a certain 
position r: 

15 l(r, 0-D) = l 0 (r)[l + \/2 mD m(r) cos(0(r) - 0 D )J 

where r denotes the pixel position, 9d is the phase angle of the gain modulation signal, 0(r) is 
the phase of the emission, mo is the gain modulation of the detector, m(r) is the modulated 
amplitude of the emission, I 0 (r) is the original intensity of the pixel (which depends on 
concentration). The equation describes the intensity of a given pixel as a function of two 

20 controlled parameters (9 D and m D ) and three unknowns (I 0 (r), 9(r), m(r)). Recall that the 
lifetime (x) can be determined if either 9(r) or m(r) is known. Since there are three 
unknowns, at least three different images are needed to determine the lifetime of the 
specimen. By controlling the phase angle of the gain modulation signal (9d), a series of 
phase-sensitive images are generated and hence lifetimes can be determined. 

25 The apparatus for phase-modulation FLIM is described in Lakowicz and Szmacinski, 

1996. Excitation is provided by the output of a cavity-dumped laser, which is synchronously 
pumped by a mode-locked Nd:YAG laser. The excitation light is expanded by a laser beam 
expander. The gated image intensifier is positioned between the target and the CCD 'camera. 
The gain of image intensifier is modulated using output of a frequency synthesizer. A CCD 

30 camera captures the phase sensitive images. A computer with FLIM software processes the 
output to generate a lifetime image. 
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Other interactions involved in the method will produce a nuclear radiation signal. As 
a radiolabel on a polymer passes through the defined region of detection, such as the station, 
nuclear radiation is emitted, some of which will pass through the defined region of radiation 
detection. A detector of nuclear radiation is placed in proximity of the defined region of 
radiation detection to capture emitted radiation signals. Many methods of measuring nuclear 
radiation are known in the art including cloud and bubble chamber devices, constant current 
ion chambers, pulse counters, gas counters (i.e., Geiger-Muller counters), solid state detectors 
(surface barrier detectors, lithium-drifted detectors, intrinsic germanium detectors), 
scintillation counters, Cerenkov detectors, etc. 

Other types of signals generated are well known in the art and have many detections 
means which are known to those of skill in the art. Among these include opposing 
electrodes, magnetic resonance, and piezoelectric scanning tips. Opposing nanoelectrodes 
can function by measurement of capacitance changes. Two opposing electrodes create an 
area of energy storage, which is effectively between the two electrodes. It is known that the 
capacitance of two opposing electrodes change when different materials are placed between 
the electrodes. This value is known as a dielectric constant. Changes in the dielectric 
constant can be measured as a change in the voltage across the two electrodes. In the present 
example, different nucleotide bases or units of a polymer may give rise to different dielectric 
constants. The capacitance changes as the dielectric constant of the unit of the polymer per 
the equation: C=KC 0 , where K is the dielectric constant and C 0 is the capacitance in the 
absence of any bases. The voltage deflection of the nanoelectrodes is then outputted to a 
measuring device, recording changes in the signal with time. 

A nanosized NMR detection device can be constructed to detect the passage of 
specific spin-labeled polymer units. The nanosized NMR detection device consists of 
magnets which can be swept and a means of irradiating the polymer with electromagnetic 
energy of a constant frequency (This is identical to holding the magnetic field constant while 
the electromagnetic frequency is swept). When the magnetic field reaches the correct 
strength, the nuclei absorb energy and resonance occurs. This absorption causes a tiny 
electric current to flow in an antenna coil surrounding the sample. The signal is amplified 
and output to a recording device. For known labeled compounds, the time of detection is 
much faster than current means of NMR detection where a full spectra of the compound in 
question is required. Known labeled units of polymers have known chemical shifts in 
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particular regions, thereby eliminating the need to perform fall spectral sweeps, lowering the 
time of detection per base to micro or milliseconds. 

A nanoscale piezoelectric scanning tip can be used to read the different units of the 
polymer based on physical contact of the different polymer units with the tip. Depending on 
5 the size and shape of the polymer unit, different piezoelectric signals are generated, creating a 
series of unit dependent changes. Labels on units are physically different than native units 
and can create a ready means for detection via a piezoelectric scanning tip. Upon contact of a 
polymer unit with the tip, the piezoelectric crystals change and give rise to a current which is 
outputted to a detection device. The amplitude and duration of the current created by the 

10 interaction of the polymer unit and the tip is characteristic of the polymer unit. 

Optical detectable signals are generated, detected and stored in a database the signals 
can be analyzed to determine structural information about the polymer. The computer may 
be the same computer used to collect data about the polymers, or may be a separate computer 
dedicated to data analysis. A suitable computer system to implement the present invention 

15 typically includes an output device which displays information to a user, a main unit 

connected to the output device and an input device which receives input from a user. The 
main unit generally includes a processor connected to a memory system via an 
interconnection mechanism. The input device and output device also are connected to the 
processor and memory system via the interconnection mechanism. 

20 Nanochannels can be prepared by electroless deposition procedures which produce a 

metal fibril running the complete width of a polycarbonate template membrane. The 
membrane can also be produced such that both faces of the membrane are covered with thin 
metal films to produce a nanodisk electrode ensemble, one of the metal films can be removed 
to expose the surface of the membrane. The metal films can be removed to expose the 

25 surface of the membrane. These electrodes can be connected at their bases to a common 

current collector. This assembly is useful for examining changes in current as polymers flow 
through changes in conductance can be measured. The preparation of such plates is 
described in Martin, C.P.R., Science, 266:1961-1965 (1994). 

Computer programs for data analysis of the detected signals are readily available from 

30 CCD manufacturers. Such programs may be executed using a general purpose computer such 
as described below. For the methods of the invention, only operations on single pixels need 
to be performed (point operations). The complexity of the point operations depend on the 
method of imaging used. Intensity based imaging offers the fastest manipulation of data and 
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because only arithmetic is performed on the individual pixels. Regardless of the imaging 
technique (intensity, TGFM, or FLIM), the algorithms performed on each pixel in each 
method are considered low-level when compared to global, whole frame operations that need 
to be performed in certain more complex imaging situations. 

5 It should be understood that one or more output devices may be connected to the 

computer system. Example output devices include a cathode ray tube (CRT) display, liquid 
crystal displays (LCD), printers, communication devices such as a modem, and audio output. 
It should also be understood that one or more input devices may be connected to the 
computer system. Example input devices include a keyboard, keypad, track ball, mouse, pen 

10 and tablet, communication device, and data input devices such as sensors. It should be 
understood the invention is not limited to the particular input or output devices used in 
combination with the computer system or to those described herein. 

The computer system may be a general purpose computer system which is 
programmable using a high level computer programming language, such as AC@. The 

15 computer system may also be specially programmed, special purpose hardware. In a general 
purpose computer system, the processor is typically a commercially available processor, of 
which the series x86 processors, available from Intel, and similar devices from AMD and 
Cyrix, the 680X0 series microprocessors available from Motorola, the PowerPC 
microprocessor from IBM and the Alpha-series processors from Digital Equipment 

20 Corporation, are examples. Many other processors are available. Such a microprocessor 
executes a program called an operating system, of which WindowsNT, UNIX, DOS, VMS 
and OS8 are examples, which controls the execution of other computer programs and 
provides scheduling, debugging, input/output control, accounting, compilation, storage 
assignment, data management and memory management, and communication control and 

25 related services. The processor and operating system define a computer platform for which 
application programs in high-level programming languages are written. 

A memory system typically includes a computer readable and writeable nonvolatile 
recording medium, of which a magnetic disk, a flash memory and tape are examples. The 
disk may be removable, known as a floppy disk, or permanent, known as a hard drive. A disk 

30 has a number of tracks in which signals are stored, typically in binary form, i.e., a form 

interpreted as a sequence of one and zeros. Such signals may define an application program 
to be executed by the microprocessor, or information stored on the disk to be processed by 
the application program. Typically, in operation, the processor causes data to be read from 
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the nonvolatile recording medium into an integrated circuit memory element, which is 
typically a volatile, random access memory such as a dynamic random access memory 
(DRAM) or static memory (SRAM). The integrated circuit memory element allows for faster 
access to the information by the processor than does the disk. The processor generally 

5 manipulates the data within the integrated circuit memory and then copies the data to the disk 
when processing is completed. A variety of mechanisms are known for managing data 
movement between the disk and the integrated circuit memory element, and the invention is 
not limited thereto. It should also be understood that the invention is not limited to a 
particular memory system. 

10 It should be understood the invention is not limited to a particular computer platform, 

particular processor, or particular high-level programming language. Additionally, the 
computer system may be a multiprocessor computer system or may include multiple 
computers connected over a computer network. 

The data stored about the polymers may be stored in a database, or in a data file, in 

15 the memory system of the computer. The data for each polymer may be stored in the 

memory system so that it is accessible by the processor independently of the data for other 
polymers, for example by assigning a unique identifier to each polymer. 

The information contained in the data and how it is analyzed depends on the number 
and type of labeled units that were caused to interact with the agent to generate signals. For 

20 instance if every unit of a single polymer, each type of unit (e.g., all the A's of a nucleic acid) 
having a specific type of label, is labeled then it will be possible to determine from analysis 
of a single polymer the order of every unit within the polymer. If, however, only one of the 
four types of units of a polymer such as a nucleic acid is labeled then more data will be 
required to determine the complete sequence of the nucleic acid. Several labeling schemes 

25 and methods for analyzing using the computer system data produced by those schemes are 
described in more detail below. The labeling strategies are described with respect to nucleic 
acids for ease of discussion. Each of these strategies, however, is useful for labeling all 
polymers. 

Several different strategies of labeling are possible, involving permutations of 
30 different types of nucleotides labeled, different percentage of nucleotides labeled, and single- 
stranded or double-stranded labeling and diversity labels, such as compound which bind to a 
polymer having a specific sequence (diversity labels are discussed in more detail below 
relative to specific embodiments). The simplest labeling scheme involves the labeling of all 
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four nucleotides with different labels. Labeling schemes using three, two, or even one label 
are also possible. 

A four nucleotide labeling scheme can be created where the A's, C's, G's, and T's of 
a target DNA is labeled with different labels. Such a molecule, upon traversing an interaction 

5 station, will generate a linear order of signals which correspond to the linear sequence of 
nucleotides on the target DNA. The advantage of using a four nucleotide strategy is its ease 
of data interpretation and the fact that the entire sequence of units can be determined from a 
single labeled polymer. Adding extrinsic labels to all four bases, however, may cause steric 
hindrance problems. In order to reduce this problem the intrinsic properties of some or all of 

10 the nucleotides may be used to label the nucleotides. As discussed above, nucleotides are 
intrinsically labeled because each of the purines and pyrimidines have distinct absorption 
spectra properties. In each of the labeling schemes described herein the nucleotides may be 
either extrinsically or intrinsically labeled but it is preferred that at least some of the 
nucleotides are intrinsically labeled when the four nucleotide labeling method is used. It is 

15 also preferred that when extrinsic labels are used with the four nucleotide labeling scheme 
that the labels be small and neutral in charge to reduce steric hindrance. 

A three nucleotide labeling scheme in which three of the four nucleotides are labeled 
may also be performed. When only three of the four nucleotides are labeled analysis of the 
data generated by the methods of the invention is more complicated than when all four 

20 nucleotides are labeled. The data is more complicated because the number and position of 
the nucleotides of the fourth unlabeled type must be determined separately. One method for 
determining the number and position of the fourth nucleotide utilizes analysis of two different 
sets of labeled nucleic acid molecules. For instance, one nucleic acid molecule may be 
labeled with A, C, and G, and another with C, G, and T. Analysis of the linear order of 

25 labeled nucleotides from the two sets yields sequence data. The three nucleotides chosen for 
each set can have many different possibilities as long as the two sets contain all four labeled 
nucleotides. For example, the set ACG can be paired with a set of labeled CGT, ACT or 
AGT. 

The sequence including the fourth nucleotide also may be determined by using only a 
30 single labeled polymer rather then a set of at least two differently labeled polymers by 

identifying the position of the fourth nucleotide on the polymer. This can be accomplished 
by determining the distance between labeled nucleotides on a nucleic acid molecule. For 
example A, C, and G are labeled and the detectable signals generated indicated that the 
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nucleic acid molecule had a sequence of AGGCAAACG (SEQ. ID. No. 2). If the distances 
between each of the nucleotides in the nucleic acid molecule are equivalent to the known 
inter-nucleotide distance for a particular combination of nucleotides except the distance 
between G and G is twice the normal inter-nucleotide distance then a T is positioned between 
the two G's and the entire molecule has a sequence of AGTGCAAACG (SEQ. ID. No. 3). 
The distance between nucleotides can be determined in several ways. Firstly, the polymer 
and the agent may be moved relative to one another in a linear manner and at a constant rate 
of speed such that a single unit of the nucleic acid molecule will pass the agent at a single 
time interval. If two time intervals elapse between detectable signals then the unlabeled 
nucleotide which is not capable of producing a detectable signal is present within that 
position. This method of determining the distance between bases is discussed in more detail 
below in reference to random one base labeling. Alternatively the polymer and the agent may 
be caused to interact with one another such that each labeled unit interacts simultaneously 
with an agent to produce simultaneous detectable signals. Each detectable signal generated 
occurs at the point along the polymer where the unit is positioned. The distance between the 
detectable signals can be calculated directly to determine whether an unlabeled unit is 
positioned anywhere along the nucleic acid molecule. 

Nucleic acid molecules may also be labeled according to a two nucleotide labeling 
scheme. Six sets of two nucleotide labeled nucleic acid molecule can be used to resolve the 
data and interpret the nucleotide sequence. Ambrose et al., 1993 and Harding and Keller, 
1992 have demonstrated the synthesis of large fluorescent DNA molecules with two of the 
nucleotides completely extrinsically labeled. The average size of the molecules were 7 kb. 
Six different combinations of two nucleotide labeling are possible using the following 
formula: 

n! 4 _ 6 

(nC k!(n-k)! 2121 
where n nucleotides are taken k at a time. The possible combinations are AC, AG, AT, CG, 
CT, and GT. Knowledge of the linear order of the labels in each of the sets allows for 
successful reconstruction of the nucleic acid sequence. Using a 4-mer (5' ACGT 3') as a 
model sequence, the theory can be demonstrated. The first set, AC, gives the information 
that there must be a C after the A. This does not give information about the number of 
nucleotides intervening the A and the C nor does it give information about any G's or T's 
preceding the A. The second set, AG, shows that there is also a G after the A. Set AT shows 
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there is a T after the A. From these three sets, it is then known that the target DNA is a 4-mer 
and that one C, one G, and one T follow the A. The subsequent sets give information on the 
ordering of these three nucleotides following the A. Set CG shows that G follows C. Set CT 
shows that T follows C. Set GT finishes the arrangement to give the final deciphered 
sequence of 5' ACGT 3'. In addition to the method using six labeled sets of nucleic acid 
molecules, the sequence can be established by combing information about the distance 
between labeled nucleotides generating detectable signals as described above and information 
obtained from fewer than six sets of two nucleotide labeled nucleic acid molecules. 

A fourth labeling scheme, the random one nucleotide labeling scheme also may be 
used. In this method, distance information which is obtained by either population analysis 
and/or instantaneous rate of DNA movement is used to determine the number of nucleotides 
separating two labeled nucleotides. Analysis of four differently labeled target molecules 
yields the complete sequence. 

One method of analysis with these labeling methods includes the use of 
complementary base information. Figure 1 demonstrates the labeling strategy in which two 
differently labeled DNA samples are required. The first sample has two of its non- 
complementary bases randomly labeled with the same fluorophore. Non-complementary 
pairs of bases are AC, AG, TC, and TG. The second sample has one of its bases randomly 
labeled. The base chosen for the second sample can be any of the four bases. In the example 
given, the two non-complementary bases are chosen to be A and C. As a result, two samples 
are prepared, one with labeled A's and C's and another with labeled A's. The DNA is 
genomically digested, end-labeled, purified, and analyzed by nanochannel FRET sequencing. 
The sequence-specific FRET information arising from each fragment is sorted into one of 
two complementary strand groups. Sorting allows population analysis to determine the 
positions of all the desired bases. The figure illustrates the generation of sequence 
information from the sorted data. The first group of analyzed information yields the positions 
of all the A's and C's on one strand. The second group analyzed yields knowledge of all the 
A's and C's on one strand. The same procedure is applied to the complementary stand. 
Knowledge of the complementary strand's A's and C's is identical to knowledge of the T's 
and G's on the other stand. The result is sequence reconstruction. To cross-verify the 
sequence, the process can be repeated for the other pairs of non-complementary bases such as 
TG, TC and AG. 
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There are two methods of determining the distance between bases. One requires 
determining the instantaneous rate of DNA movement, which is readily calculated from the 
duration of energy transfer or quenching for a particular label. Another involves analyzing a 
population of target DNA molecules and its corresponding Gaussian distance distributions. 

The instantaneous rate method, involves a determination of distance separation based 
on the known instantaneous rate of DNA movement (v) multiplied by the time of separation 
between signals (t). Instantaneous rate is found by measuring the time that it takes for a 
labeled nucleotide to pass by the interaction station. Since the length of the concentrated area 
of agent (d) is known (through calibration and physical measurement of the localized region 
of the agent, e.g., the thickness of a concentrated donor fluorophore area), the rate is simply 
v=d/t. As shown in Figure 2 analysis of raw data demonstrating changes in energy emission 
patterns resulting from sequential detectable signals when plotted produces a curve which 
from left to right shows two energy intensity decreases, followed by two energy intensity 
increases. The plateau from the first energy intensity decrease (denoted ti) is double that of 
the second plateau (t 2 ). The length of the interaction station is given as 5 1 A. From this 
given information, the number of labeled nucleotides is known. Furthermore, the distance of 
separation of the two is determined by relating the rate of DNA movement to the time of the 
donor intensity plateaus. 

The number of labeled nucleotides is simply denoted by the number of intensity 
decreases. In Figure 2, there are two intensity decreases. Accordingly, there must be two 
detectable labels on the DNA. To determine the distance of base separation, it is necessary to 
know the instantaneous rate of DNA movement, which is found by knowing the time for one 
labeled nucleotide to cross the localized region of the agent and the length of the localized 
region of the agent. The length of the localized region of the agent is given as 5 1 A. The 
time for one labeled nucleotides crossing the localized region of the agent is bounded by the 
first intensity decrease and the first intensity increase (denoted as the gray shaded region, 7.5 
s). The rate of DNA movement is 6.8 A/s. The base separation is derived from the time 
separating the labeled nucleotides (ti = 5 s) multiplied by the rate (6.8 A), which is equal to 
10 base pairs. As a means of cross-verification, 5 1 A - t 2 v also yields the base separation. 

In the population method the entire population of labeled nucleotide is considered. 
Knowledge of the length of the localized region of the agent and instantaneous rate, as 
required for the rate method, is not necessary. Use of population analyses statistically 
eliminates the need for precision measurements on individual nucleic acid molecules. 
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An example of population analyses using five nucleic acid molecules each traversing 
a nanochannel is described below. Five molecules representing a population of identical 
DNA fragments are prepared. In a constant electric field, the time of detection between the 
first and second labeled nucleotide should be identical for all the DNA molecules. Under 
5 experimental conditions, these times differ slightly, leading to a Gaussian distribution of 
times. The peak of the Gaussian distribution is characteristic of the distance of separation (d) 
between two labeled nucleotides. 

An additional example utilizing a population of one nucleotide randomly labeled 
nucleic acid molecule (six molecules represent the population) further illustrates the concept 

10 of population analysis and the determination of distance information. The nucleic acid is 
end-labeled to provide a reference point. With enough nucleic acid molecules, the distance 
between any two A's can be determined. Two molecules, when considered as a sub- 
population, convey the base separation molecules, distributions of 4 and 6 base separations 
are created. Extending the same logic to rest of the population, the positions of all the A's on 

15 the DNA can be determined. The entire sequence is generated by repeating the process for 
the other three bases (C, G, and T). 

In addition to labeling all of one type of unit, it is possible to use a one-nucleotide 
labeling scheme where not every nucleotide of one type is labeled. An outline of this method 
is shown in Figure 3 which shows a representative population of random A-labeled fragments 

20 for a 16-mer with the sequence 5' ACGTACGTACGTACGT 3' (SEQ. ID. No. 1). Each 
individually labeled DNA molecule has half of its A's labeled in addition to 5' and 3' end 
labels. With a large population of randomly labeled fragments, the distance between every 
successive A on the target DNA can be found. The end labels serve to identify the distance 
between the ends of the DNA and the first A. Repeating the same analysis for the other 

25 nucleotides generates the sequence of the 1 6-mer. The advantages of using such a method 
includes lack of steric effects and ease of labeling. This type of labeling is referred to as 
random labeling. A polymer which is "randomly labeled" is one in which fewer than all of a 
particular type of unit are labeled. It is unknown which units of a particular type of a 
randomly labeled polymer are labeled. 

30 As mentioned briefly above, various combinations of the labeling schemes can be 

used together. In all of the methods listed above, either ordered linear information from 
signals or distance information between nucleotides is considered. These two aspects can be 
combined to give methods that rely on both ordered linear and distance information. For 
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example a random one nucleotide labeling strategy expanded to a random four nucleotide 
labeling strategy. Random four nucleotide labeling is where a fraction of all four nucleotides 
is labeled. A population of molecules have a fraction of the four nucleotides labeled. Each 
of the four nucleotides have a unique label. Analysis of a randomly labeled population 

5 results in generation of the sequence data. 

The use of double-stranded DNA allows for variations. A single-stranded two- 
nucleotide labeling scheme can be performed when two of the nucleotides on one strand of 
DNA are fully replaced by labeled nucleotides. To reduce the steric constraints imposed by 
two extrinsically labeled nucleotides while preserving the theory behind two-nucleotide 

10 labeling, it is possible to label one nucleotide fully on each of the complementary strands to 
achieve the same end. This method involves using double-stranded DNA in which each 
strand is labeled with a different label. Six differently labeled duplex DNA sets will produce 
a data set which is adequate to provide sequence information. Each complementary strand of 
DNA should have one of the nucleotides labeled. In each of the duplex DNA sets, the 

15 equivalent of two different nucleotides (possible combinations are AC, AG, AT, CG, CT, 
GT) are labeled. When both complementary strands have the adenines labeled, this is 
equivalent to the combination AT. In duplex two-nucleotide labeling, the advantage is that 
only one nucleotide on each strand is labeled, allowing longer labeled strands to be 
synthesized as compared to two-nucleotide labeling on single-stranded DNA. In practice, it 

20 has been shown that synthesis of DNA fragments with one nucleotide completely labeled can 
be achieved with lengths much greater than 10 kb (Ambrose et al., 1993; Harding and Keller, 
1992). 

One use for the methods of the invention is to determine the sequence of units within 
a polymer. Identifying the sequence of units of a polymer, such as a nucleic acid, is an 

25 important step in understanding the function of the polymer and determining the role of the 
polymer in a physiological environment such as a cell or tissue. The sequencing methods 
currently in use are slow and cumbersome. The methods of the invention are much quicker 
and generate significantly more sequence data in a very short period of time. 

Sequencing of a polymer may encompass the sequencing of the entire polymer or 

30 portions of the polymer or even the identification of an individual unit on the polymer. One 
method for identifying an individual unit of a polymer involves the steps of transiently 
exposing the individual unit of the polymer, the identity of which is unknown, to an agent 
selected from the group consisting of electromagnetic radiation, a quenching source and a 
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fluorescence excitation source to generate an interaction with a detectable signal 
characteristic of the individual unit, and detecting and distinguishing the signal from signals 
generated from adjacent signal generating units of the polymer as an indication of the identity 
of the individual unit. 

5 The individual unit is "transiently exposed" to the agent in order to produce a 

detectable signal characteristic of the individual unit. "Transiently exposed" as used herein 
means that the unit is positioned within interactive proximity of the agent for enough time to 
produce a signal and then is moved out of interactive proximity. The exact length of time 
required to produce a signal will depend on the individual unit and the agent involved but 

10 generally the amount of time is between one nanosecond and one second. 

The signal characteristic of the individual unit is distinguished from signals generated 
from adjacent signal generating units of the polymer. An "adjacent signal generating unit" is 
the unit nearest to the individual unit which when exposed to the agent produces a detectable 
signal. It is not necessarily the unit which is directly linked to the individual unit unless the 

15 unit which is directly linked is labeled (intrinsically or extrinsically) and produces a 
detectable signal. 

In the case when the agent is one or more ffuorophores the interactive proximity 
between the agent and the unit is the energy transfer proximity and the signal produced is 
fluorescence resonance energy transfer. "Energy transfer proximity" as used herein is the 

20 distance between the unit and the fluorophore which allows interaction between two 

complementary sources if one source overlaps with the absorption spectrum of the other 
source. "Fluorescence resonance energy transfer" as used herein is the transfer of photonic 
energy between ffuorophores with overlapping emission and absorption spectra. 

Another method for identifying an individual unit of a polymer involves the steps of 

25 transiently moving the individual unit of the polymer relative to a station, the identity of the 
individual unit being unknown, detecting a signal arising from a detectable physical change 
in the unit or the station, and distinguishing said signal from signals arising from exposure of 
adjacent signal generating units of the polymer to the station as an indication of the identity 
of the individual unit. 

30 Thus in one aspect, the methods of the invention can be used to identify one, some, or 

all of the units of the polymer. This is achieved by identifying the type of individual unit and 
its position on the backbone of the polymer by determining whether a signal detected at that 
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particular position on the backbone is characteristic of the presence of a particular labeled 
unit. 

The methods of the invention also are useful for identifying other structural properties 
of polymers. The structural information obtained by analyzing a polymer according to the 

5 methods of the invention may include the identification of characteristic properties of the 
polymer which (in turn) allows, for example, for the identification of the presence of a 
polymer in a sample or a determination of the relatedness of polymers, identification of the 
size of the polymer, identification of the proximity or distance between two or more 
individual units of a polymer, identification of the order of two or more individual units 

10 within a polymer, and/or identification of the general composition of the units of the polymer. 
Such characteristics are useful for a variety of purposes such as determining the presence or 
absence of a particular polymer in a sample. For instance when the polymer is a nucleic acid 
the methods of the invention may be used to determine whether a particular genetic sequence 
is expressed in a cell or tissue. The presence or absence of a particular sequence can be 

15 established by determining whether any polymers within the sample express a characteristic 
pattern of individual units which is only found in the polymer of interest i.e., by comparing 
the detected signals to a known pattern of signals characteristic of a known polymer to 
determine the relatedness of the polymer being analyzed to the known polymer. The entire 
sequence of the polymer of interest does not need to be determined in order to establish the 

20 presence or absence of the polymer in the sample. Similarly the methods may be useful for 
comparing the signals detected from one polymer to a pattern of signals from another 
polymer to determine the relatedness of the two polymers. 

The proximity of or distance between two individual units of a polymer may be 
determined according to the methods of the invention. It is important to be able to determine 

25 the proximity of or distance between two units for several reasons. Each unit of a polymer 
has a specific position along the backbone. The sequence of units serves as a blueprint for a 
known polymer. The distance between two or more units on an unknown polymer can be 
compared to the blueprint of a known polymer to determine whether they are related. 
Additionally the ability to determine the distance between two units is important for 

30 determining how many units, if any, are between the two units of interest. 

In order to determine the proximity of two individual units of a polymer of linked 
units the polymer is moved relative to a station, where each individual unit is exposed to the 
station to produce a characteristic signal arising from a detectable physical change in the unit 
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or station. Each of the characteristic signals generated is then detected and the amount of 
time elapsed between detecting each characteristic signal is measured as described above. 
The amount of time elapsed is indicative of the proximity of the two individual units. The 
station may be an interaction station and the unit may be exposed to an agent to produce an 
5 electromagnetic signal. 

A "signal characteristic of an interaction" as used herein is a signal which is expected 
to result from the interaction of the station and a specific labeled unit. The specific signal 
generated will depend on the type of station as well as the type of labeled unit. For instance 
if the station is an agent which is electromagnetic radiation and the labeled unit is a 

10 fluorophore then the interaction between the two will result in the emission of 

electromagnetic radiation by the fluorophore at a wavelength at which the fluorophore is 
known to emit. If the station is an agent which is a scintillation layer and the unit is 
radioactive then the interaction between the two will result in the emission of electromagnetic 
radiation in the form of fluorescence. 

15 It is possible to determine the order of the units of a polymer using the methods of the 

invention. In one aspect of the invention the order of two individual units of a polymer can 
be determined by moving the polymer linearly with respect to a station and exposing two of 
the individual units to the station to produce first and second detectable signals arising from 
physical changes in the station or the unit. The order of the signals is an indication of the 

20 order of the two individual units. 

The general composition of the units of the polymer may also be determined by the 
methods of the invention. For instance, if the polymer is a nucleic acid the methods of the 
invention can provide information on the percentage of purines vs. pyrimidines or the 
percentage of A, C, T, and G in the nucleic acid. 

25 Quantitative information on the size of the sample may also be determined by the 

methods of the invention. For instance, the size of a polymer can be determined by 
determining the number of individual units which make up the polymer. The number of units 
which make up the polymer is determined by measuring the amount of time that is required 
for the entire polymer to traverse past an agent at an interaction site in a linear manner and 

30 dividing that by the average length of time for an individual unit of that particular type of 
polymer to completely traverse past the site. 

In addition to information about a specific unit the methods of the invention may be 
used to identify greater than one unit at a time in order to provide information about a 
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polymer. In one aspect the method is carried out by providing a labeled polymer of linked 
units, detecting signals from labeled unit specific markers of less than all of the linked units, 
and storing a signature of the signals detected to analyze the polymer. In this aspect of the 
invention each unit of the labeled polymer may be labeled with a unit specific marker or less 

5 than all of the units may be labeled with a unit specific marker. 

This method is particularly useful for analyzing multiple units of a polymer at one 
time. This is accomplished by using a unit specific marker which is labeled and which 
interacts with more than one unit in a sequence specific manner. As used herein a "unit 
specific marker" is a compound which specifically interacts with one or more units of a 

10 polymer and is capable of identifying those units. For instance a unit specific marker for a 
nucleic acid molecule can be a labeled dimers, trimers, etc. which bind to a specific sequence 
of bases, such as TG, AG, ATC, etc. By identifying the presence or position of the labeled 
markers structural information about the polymer can be derived. For instance, the presence 
of the marker on a polymer can reveal the identity of the polymer. This enables the presence 

15 or absence of a polymer in a solution or mixture of polymers to be determined. The order, 
distance, number etc. of the markers on a polymer can provide information about the 
sequence or composition of a polymer. Other unit specific markers include but are not 
limited to sequence specific major and minor groove binders and intercallators, sequence 
specific DNA and peptide binding proteins, sequence specific peptide-nucleic acids, etc. 

20 Many such unit specific markers exist and are well known to those of skill in the art. 

This type of analysis can be used in one embodiment to identify DNA fragments by 
analyzing the hybridization patterns of multiple probes to individual fragments of polymers. 
The current state-of-the-art methods for hybridization analysis of DNA rely upon DNA chips. 
The methods of the invention is advantageous for a number of reasons. The number, type, 

25 order, and distance between the multiple probes bound to an unknown fragment of DNA can 
be determined. This information can be used to identify the number of differentially 
expressed genes unambiguously. Current hybridization approaches can only determine the 
type of probes bound to a given fragment. Furthermore, the methods of the invention are able 
to quantitate precisely the actual number of particular expressed genes. Current methods rely 

30 on quantitation of fluorescence intensities, which often give rise to errors due to non- 
linearities in the detection system. Given the great amount of information generated, the 
methods of the invention do not require a selection of expressed genes or unknown nucleic 
acids to be assayed. This is in contrast to the requirement of different DNA chips for 
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different genes, sets of expressed genes to be analyzed, and also different organisms. The 
methods of the invention can identify the unknown expressed genes by computer analysis of 
the hybridization patterns generated. The data obtained from linear analysis of the DNA 
probes are then matched with information in a database to determine the identity of the target 

5 DNA. The methods can thus analyze information from hybridization reactions, which can 
then be applied to diagnostics and determination of gene expression patterns. 

A "signature" as used herein is a sequence-specific signal arising from a labeled 
polymer. The signature includes information about the structure of the polymer. For 
instance, the signature of a polymer may be defined by a series of consecutive units or by 

10 specific units spaced a particular distance apart from one another. The signature of the 

polymer identifies the polymer. Signatures are useful for uniquely identifying fragments by 
identifying bases at certain positions along the length of a strand of DNA. The probability of 
knowing any one position is 1/4. Unambiguous identification of a fragment comes with 
roughly twenty positions identified (l/4 20 =9.1xl20" 13 ). For a fragment with 20 detected 

15 labels and 1 0% detection/labeling, the size of the fragment needs to be only 200 base pairs. 
The proposed read length is on the order of kilobases, which should unambiguously identify 
any fragment. The identification of fragments allows for grouping by similar sequences, 
making sequence reconstruction by population analysis possible. 

A preferred method of analysis of the invention involves the use of radioactively 

20 labeled polymers. The type of radioactive emission influences the type of detection device 
used. In general, there are three different types of nuclear emission including alpha, beta, and 
gamma radiation. Alpha emission cause extensive ionization in matter and permit individual 
counting by ionization chambers and proportional counters, but more interestingly, alpha 
emission interacting with matter may also cause molecular excitation, which can result in 

25 fluorescence. The fluorescence is referred to as scintillation. Beta decay which is weaker 
than alpha decay can be amplified to generate an adequate signal. Gamma radiation arises 
from internal conversion of excitation energy. Scintillation counting of gamma rays is 
efficient and produces a strong signal. Sodium iodide crystals fluoresce with incident gamma 
radiation. 

30 A "scintillation" layer or material as used herein is any type of material which 

fluoresces or emits light in response to excitation by nuclear radiation. Scintillation materials 
are well known in the art. Aromatic hydrocarbons which have resonance structures are 
excellent scintillators. Anthracene and stilbene fall into the category of such compounds. 
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Inorganic crystals are also known to fluoresce. In order for these compounds to luminesce, 
the inorganic crystals must have small amounts of impurities, which create energy levels 
between valence and conduction bands. Excitation and de-excitation can therefore occur. In 
many cases, the de-excitation can occur through phosphorescent photon emission, leading to 

5 a long lifetime of detection. Some common scintillators include Nal (Tl), ZnS (Ag), 
anthracene, stilbene, and plastic phosphors. 

Many methods of measuring nuclear radiation are known in the art and include 
devices such as cloud and bubble chamber devices, constant current ion chambers, pulse 
counters, gas counters (i.e., Geiger-Miiller counters), solid state detectors (surface barrier 

10 detectors, lithium-drifted detectors, intrinsic germanium detectors), scintillation counters, 
Cerenkov detectors, etc. 

Analysis of the radiolabeled polymers is identical to other means of generating 
polymer dependent impulses. For example, a sample with radiolabeled A's can be analyzed 
by the system to determine relative spacing of A's on a sample DNA. The time between 

15 detection of radiation signals is characteristic of the polymer analyzed. Analysis of four 
populations of labeled DNA (A's, C's, G's, T's) can yield the sequence of the polymer 
analyzed. The sequence of DNA can also be analyzed with a more complex scheme 
including analysis of a combination of dual labeled DNA and singly labeled DNA. Analysis 
of a A and C labeled fragment followed by analysis of a A labeled version of the same 

20 fragment yields knowledge of the positions of the A's and C's. The sequence is known if the 
procedure is repeated for the complementary strand. The system can further be used for 
analysis of polymer (polypeptide, RNA, carbohydrates, etc.), size, concentration, type, 
identity, presence, sequence and number. 

The methods described above can be performed on a single polymer or on more than 

25 one polymer in order to determine structural information about the polymer. The invention 
also encompasses the practice of the methods described above on multiple polymers. These 
methods and an apparatus for performing the methods of the invention simultaneously on a 
plurality of polymers are described in detail below. 

Some of the methods described above are based on an interaction involving energy 

30 transfer or quenching to produce a detectable signal. The involvement of energy transfer or 
quenching is described in either one of two limitations in these embodiments of the invention. 
One limitation is that the agent involved in the interaction is selected from the group 
consisting of electromagnetic radiation, a quenching source or a fluorescence excitation 
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source. The other limitation is that the detectable signal is an electromagnetic radiation 
signal. It should be apparent to one of ordinary skill in the art that each of the methods can 
encompass the other limitation instead of the one described and still encompass the notion of 
interaction involving energy transfer or quenching. For instance in addition to encompassing 

5 the method for analyzing a polymer by exposing the units of the polymer to the agent selected 
from the group consisting of electromagnetic radiation, a quenching source, and a 
fluorescence excitation source to produce a detectable signal, the invention in these 
embodiments also encompasses a method for analyzing a polymer by exposing the units of a 
polymer to an agent to produce a detectable electromagnetic signal. 

10 In addition to the methods involving energy transfer, quenching or electromagnetic 

radiation signals, the methods of the invention described above can be performed by 
detecting signals which arise from a detectable physical change in the unit of the polymer or 
the station. As used herein a "detectable physical change" in the unit of the polymer or the 
station is any type of change which occurs in the unit of the polymer or the station as a result 

15 of exposing the unit to the station. Once the unit is exposed to the station a detectable signal 
is created. The station may be for instance, an interaction station or a signal generation 
station, each of which is discussed in detail herein. The type of change that occurs in the 
station or the unit to produce the detectable signal depends on the type of station and the type 
of unit. Several examples of station-unit combinations which undergo a change to produce a 

20 detectable signal are discussed herein for exemplary purposes. Those of skill in the art will 
be able to derive other station-unit combinations that fall within the scope of the invention. 

When the interaction between the station and the unit of the polymer is based on 
energy transfer, either the unit or the station or both may physically change to produce a 
signal. In one embodiment the station may transfer energy to the unit causing the unit to emit 

25 an energy unit specific signal. The physical change which occurs in the unit results from the 
change in energy state. In another embodiment the unit may transfer energy to the station 
causing the station to emit a unit-specific signal resulting from the specific energy transfer. 
In still other embodiments a partner compound may cause the physical change which 
produces a signal. When the interaction occurs between a radioactive unit and a station the 

30 unit physically changes by releasing energy. 

Another aspect of the invention encompasses methods for analyzing a plurality of 
polymers. Each of the polymers is analyzed by sequentially detecting interactions of units of 
a plurality of polymers at a signal generation station. These methods include but are not 
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limited to a method for characterizing a test polymer, a method for sequencing a polymer, a 
method for determining the order of units of a polymer, a method for determining the 
distance between units of a polymer, and analyzing a set of polymers. These methods 
encompass but are not limited to interactions resulting from energy transfer or quenching. 

5 A method for characterizing a test polymer is performed by obtaining polymer 

dependent impulses for each of a plurality of polymers, comparing the polymer dependent 
impulses of the plurality of polymers, determining the relatedness of the polymers based upon 
similarities between the polymer dependent impulses of the polymers, and characterizing the 
test polymer based upon the polymer dependent impulses of related polymers. 

10 A "polymer dependent impulse" as used herein is a detectable physical quantity which 

transmits or conveys information about the structural characteristics of only a single unit of a 
polymer. The physical quantity may be in any form which is capable of being detected. For 
instance the physical quantity may be electromagnetic radiation, chemical conductance, 
electrical conductance, etc. The polymer dependent impulse may arise from energy transfer, 

15 quenching, changes in conductance, mechanical changes, resistance changes, or any other 
physical changes. Although the polymer dependent impulse is specific for a particular unit, a 
polymer having more than one of a particular labeled unit will have more than one identical 
polymer dependent impulse. Additionally, each unit of a specific type may give rise to 
different polymer dependent impulses if they have different labels. 

20 The method used for detecting the polymer dependent impulse depends on the type of 

physical quantity generated. For instance if the physical quantity is electromagnetic radiation 
then the polymer dependent impulse is optically detected. An "optically detectable" polymer 
dependent impulse as used herein is a light based signal in the form of electromagnetic 
radiation which can be detected by light detecting imaging systems. When the physical 

25 quantity is chemical conductance then the polymer dependent impulse is chemically detected. 
A "chemically detected" polymer dependent impulse is a signal in the form of a change in 
chemical concentration or charge such as an ion conductance which can be detected by 
standard means for measuring chemical conductance. If the physical quantity is an electrical 
signal then the polymer dependent impulse is in the form of a change in resistance or 

30 capacitance. 

As used herein the "relatedness of polymers" can be determined by identifying a 
characteristic pattern of a polymer which is unique to that polymer. For instance if the 
polymer is a nucleic acid then virtually any sequence of 1 0 contiguous nucleotides within the 
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polymer would be a unique characteristic of that nucleic acid molecule. Any other nucleic 
acid molecule which displayed an identical sequence of 10 nucleotides would be a related 
polymer. 

A "plurality of polymers" is at least two polymers. Preferably a plurality of polymers 

5 is at least 50 polymers and more preferably at least 1 00 polymers. 

The polymer dependent impulses may provide any type of structural information 
about the polymer. For instance these signals may provide the entire or portions of the entire 
sequence of the polymer, the order of polymer dependent impulses, or the time of separation 
between polymer dependent impulses as an indication of the distance between the units. 

10 The polymer dependent impulses are obtained by interaction which occurs between 

the unit of the polymer and the environment at a signal generation station. A "signal 
generation station" as used herein is a station that is an area where the unit interacts with the 
environment to generate a polymer dependent impulse. In some aspects of the invention the 
polymer dependent impulse results from contact in a defined area with an agent selected from 

15 the group consisting of electromagnetic radiation, a quenching source, and a fluorescence 
excitation source which can interact with the unit to produce a detectable signal. In other 
aspects the polymer dependent impulse results from contact in a defined area with a chemical 
environment which is capable of undergoing specific changes in conductance in response to 
an interaction with a molecule. As a molecule with a specific structure interacts with the 

20 chemical environment a change in conductance occurs. The change which is specific for the 
particular structure may be a temporal change, e.g., the length of time required for the 
conductance to change may be indicative that the interaction involves a specific structure or a 
physical change. For instance, the change in intensity of the interaction may be indicative of 
an interaction with a specific structure. In other aspects the polymer dependent impulse 

25 results from changes in capacitance or resistance caused by the movement of the unit between 
microelectrodes or nanoelectrodes positioned adjacent to the polymer unit. For instance the 
signal generation station may include microelectrodes or nanoelectrodes positioned on 
opposite sides of the polymer unit. The changes in resistance or conductance which occur as 
a result of the movement of the unit past the electrodes will be specific for the particular unit. 

30 A method for determining the distance between two individual units is also 

encompassed by the invention. In order to determine the distance between two individual 
units of a polymer of linked units the polymer is caused to pass linearly relative to an signal 
generation station and a polymer dependent impulse which is generated as each of the two 
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individual units passes by the signal generation station is measured. Each of the steps is then 
repeated for a plurality of similar polymers. A polymer is said to pass linearly relative to a 
signal generation station when each unit of the polymer passes sequentially by the signal 
generation station. 

5 Each of the steps is repeated for a plurality of similar polymers to produce a data set. 

The distance between the two individual units can then be determined based upon the 
information obtained from the plurality of similar polymers by analyzing the data set. 

The method also includes a method for identifying a quantity of polymers including a 
label. For instance, it is possible to determine the number of polymers having a specific unit 

10 or combination of units in a sample. In a sample of mRNA, for example, the number of a 
particular mRNA present in the sample can be determined. This is accomplished by 
identifying a pattern or signature characteristic of the desired mRNA molecule. The sample 
of RNA can then be analyzed according to the methods of the invention and the number of 
mRNA molecules having the specific pattern or signature can be determined. 

15 As used herein "similar polymers" are polymers which have at least one overlapping 

region. Similar polymers may be a homogeneous population of polymers or a heterogenous 
population of polymers. A "homogeneous population" of polymers as used herein is a group 
of identical polymers. A "heterogenous population" of similar polymers is a group of similar 
polymers which are not identical but which include at least one overlapping region of 

20 identical units. An overlapping region typically consists of at least 10 contiguous 

nucleotides. In some cases an overlapping region consists of at least 1 1, 12, 13, 14, 15, 16, 
17, 18, 19, 20, 21, or 22 contiguous nucleotides. 

A "plurality of similar polymers" is two or more similar polymers. Preferably a 
plurality of similar polymers is 50 or more similar polymers. More preferably a plurality of 

25 similar polymers is 100 or more similar polymers. 

A "data set" as used herein is a set of information defining the polymer dependent 
impulses generated by similar polymers. The data set is analyzed as discussed above and the 
method of analysis used depends on the type of labeling scheme used to generate the labeled 
polymers. Nucleic acid sequencing is a particularly preferred embodiment of the methods of 

30 the invention. 

Currently, less than 5% of the human genome has been sequenced. This translates 
into a small fraction of the ideal in human sequence knowledge, which is the sequence of all 
individuals. For an instance, for the human population, there are 1.4xl0 19 (5 billion people x 
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3xl0 9 bases/person). So far, only 2xl0~ 10 percent of all human genetic information is known. 

The rate of sequencing of the human genome by all world-wide efforts is roughly 3xl0 9 /15 

years, or 550,000 bases/day, at a cost of >$l/base. Sequencing by the methods of the 

invention described herein will constitute an inordinate breakthrough in the rate of 
5 sequencing. The predicted time to complete one human genome with one machine is ~15 

hours. Several dynamic arrays in parallel will be able to complete the sequence of one 

human genome in a fraction of an hour. 

A method for sequencing a polymer of linked units is also encompassed by the 

invention. The method is performed by obtaining polymer dependent impulses from each of 
10 a plurality of overlapping polymers, at least a portion of each of the polymers having a 

sequence of linked units identical to the other of the polymers, and comparing the polymer 

dependent impulses to obtain a sequence of linked units which is identical in the plurality of 

polymers. 

The plurality of overlapping polymers is a set of polymers in which each polymer has 
15 at least a portion of its sequence of linked units which is identical to the other polymers. The 

portion of sequence which is identical is referred to as the overlapping region and which 

includes at least ten contiguous units. 

In another aspect of the invention the order of units of a polymer of linked units can 

be determined by moving the polymer linearly relative to a signal generation station and 
20 measuring a polymer dependent impulse generated as each of two individual units, each 

giving rise to a characteristic polymer dependent impulse pass by the signal generation 

station. These steps are repeated for a plurality of similar polymers and the order of at least 

the two individual units is determined based upon the information obtained from the plurality 

of similar polymers. 

25 A method for analyzing a set of polymers, in which each of the polymers of the set is 

an individual polymer of linked units, is encompassed by the invention. The method involves 
the step of orienting the set of polymers parallel to one another, and detecting a polymer 
specific feature of the polymers. 

The set of polymers are oriented parallel to one another. The polymers may be 

30 oriented by any means which is capable of causing the polymers to be positioned parallel to 
one another. For instance an electric field may be applied to the polymers to cause them to 
be oriented in a parallel form. Preferably the orientation step is in a solution free of gel. 
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A "polymer specific feature" as used herein is any structural feature of polymer which 
relates to its sequence. For instance a polymer specific feature includes but is not limited to 
information about the polymer such as the length of the polymer, the order of linked units in 
the polymer, the distance between units of the polymer, the proximity of units in the polymer, 

5 the sequence of one, some or all of the units of the polymer, and the presence of the polymer. 

The step of detecting the polymer specific feature may be performed simultaneously 
for all of the polymers. This step encompasses the sequential detection of each of the units of 
all of the polymers. This can be accomplished by passing linearly each of the polymers 
relative to a plurality of signal generation stations, and detecting and distinguishing polymer 

10 dependent impulses generated as said polymers pass said signal generation stations. 

The invention also includes a method for analyzing a set of polymers, each polymer 
of the set being an individual polymer of linked units. The method is performed by orienting 
the set of polymers in an electric field, simultaneously moving the set of polymers through 
defined respective channels, and detecting a polymer specific feature as the polymers are 

15 moved through the channels. The step of simultaneously moving the set of polymers 

through respective channels is carried out by moving one polymer per channel such that each 
unit passes the station individually. More than one polymer may be in the channel at a time if 
the polymers are positioned in tandem and only one unit interacts with one station at a time. 
A "defined respective channel" as used herein is a channel in which the structure is 

20 determined before the polymer enters the channel such that the polymer will follow a defined 
path as it passes through the channel. Channels such as those found in a gel matrix are not 
defined respective channels. 

The methods of the invention may also be used to detect resonance energy transfer or 
quenching between two interactive partners capable of such transfer or quenching. As used 

25 herein resonance energy transfer (RET) is the transfer of photonic energy between two 
compounds with overlapping emission and absorption spectra. Fluorescence resonance 
energy transfer (FRET) is the transfer of photonic energy between fluorophores. The two 
interactive partners are any compounds which are capable of energy transfer or quenching 
i.e., light emissive compounds or quenchers. 

30 The method is performed by bringing the two partners in close enough proximity to 

permit such transfer or quenching, applying an agent to one of said partners, the agent 
selected from the group consisting of electromagnetic radiation, a quenching source and a 
fluorescence excitation source, shielding fluorescence resonance energy transfer and 
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quenching occurring from electromagnetic radiation emission and interaction between the 
partners with a material shield, and detecting the emitted electromagnetic radiation. 

As used herein a "material shield" is any material which prevents or limits energy 
transfer or quenching. Such materials include but are not limited to conductive materials, 

5 high index materials, and light impermeable materials. In a preferred embodiment the 

material shield is a conductive material shield. As used herein a "conductive material shield" 
is a material which is at least conductive enough to prevent energy transfer between donor 
and acceptor sources. 

Each of the above methods of the invention are useful for at least various aspects of 

10 sequencing polymers. Also all of the methods can be used with the various labeling schemes 
described with respect to the method of analyzing polymers. 

The methods of the invention can be accomplished using any device which produces a 
specific detectable signal for an individual unit of a polymer. One type of device which 
enables this type of analysis is one which promotes linear transfer of a polymer past an 

15 interaction station or a signal generation station. According to one aspect of the invention, an 
article of manufacture which is useful for performing the methods of the invention is 
provided. The article of manufacture includes a wall material having a surface defining a 
channel, an agent selected from the group consisting of an electromagnetic radiation source, a 
quenching source, a luminescent film layer, and a fluorescence excitation source, attached to 

20 the wall material adjacent to the channel, wherein the agent is close enough to the channel 
and is present in an amount sufficient to detectably interact with a partner compound selected 
from the group consisting of a light emissive compound and a quencher passing through the 
channel. 

A "wall material" as used herein is a solid or semi-solid barrier of any dimensions 
25 which is capable of supporting at least one channel. A semi-solid material is a self 

supporting material and may be for instance a gel material such as a polyacrylamide gel. For 
instance the wall material may be composed of a single support material which may be 
conducting or non-conducting, light permeable or light impermeable, clear or unclear. In 
some instances the agent is embedded within the wall material. In these instances the wall 
30 material can be solely or partially made of a non-conducting layer, a light permeable layer or 
a clear layer to allow the agent to be exposed to the channel formed in the wall material to 
allow signal generation. When the wall material is only partially made from these materials 
the remaining wall material may be made from a conducting, light impermeable or unclear 
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layer, which prevent signal generation. In some cases the wall material is made up of layers 
of different materials. For instance, the wall material may be made of a single conducting 
layer and a single non-conducting layer. Alternatively the wall material may be made of a 
single non-conducting layer surrounded by two conducing layers. Multiple layers and 

5 various combinations of materials are encompassed by the wall material of the invention. 

As used herein a "luminescent film layer" is a film which is naturally luminescent or 
made luminescent by some means of excitation or illumination, e.g., electrooptic thin films 
and high index films illuminated by internal reflection. 

A "conductive material" as used herein is a material which is at least conductive 

10 enough to prevent energy transfer between a donor and an acceptor. 

A "nonconductive material" as used herein is a material which conducts less than that 
amount that would allow energy transfer between a donor and an acceptor. 

A "light permeable material" as used herein is a material which is permeable to light 
of a wavelength produced by the specific electromagnetic radiation, quenching source, or the 

15 fluorescence excitation source being used. 

A "light impermeable material" as used herein is a material which is impermeable to 
light of a wavelength produced by the specific electromagnetic radiation, quenching source, 
or the fluorescence excitation source being used. 

A "channel" as used herein is a passageway through a medium through which a 

20 polymer can pass. The channel can have any dimensions as long as a polymer is capable of 
passing through it. For instance the channel may be an unbranched straight cylindrical 
channel or it may be a branched network of interconnected winding channels. Preferably the 
channel is a straight nanochannel or a microchannel. A "nanochannel" as used herein is a 
channel having dimensions on the order of nanometers. The average diameter of a 

25 nanochannel is between 1 nm and 999 nm. A "microchannel" as used herein is a channel 
having dimensions on the order of micrometers. The average diameter of a microchannel is 
between lum and 1mm. Preferred specifications and dimensions of channels useful 
according to the invention are set forth in detail below. In a preferred embodiment, the 
channel is fixed in the wall. 

30 An agent is attached to the wall material in such a manner that it will detectably 

interact with a partner compound by undergoing energy transfer or quenching with the 
partner light emissive compound which is passing through the channel of the wall material. 
In order to interact with the partner compound the agent can be positioned in close proximity 
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to the channel. For example, the agent may be attached to the inside of the channel, attached 
to the external surface of the wall material, attached to a concentrated region of the external 
surface of the wall material surrounding the rim of the channel, embedded within the wall 
material, or embedded in the form of a concentric ring in the wall material surrounding the 

5 channel. Optionally the agent may cover the entire surface of the wall material or may be 
embedded throughout the entire wall material. In order to improve signal generation when 
the agent is not localized, a mask may be used to cover some areas of the wall material such 
that only localized regions of agent are exposed. A "mask" as used herein is an object which 
has openings of any size or shape. More than one agent may be attached to the wall material 

10 in order to produce different signals when the agents are exposed to the partner agent. 

The agent may be attached to the surface of the wall material by any means of 
performing attachment known in the art. Examples of methods for conjugating biomaterials 
are presented in Hermanson, G. T., Bioconjugate Techniques, Academic Press, Inc., San 
Diego, 1996, which is hereby incorporated by reference. 

15 When the agent is attached to the surface of the wall material it may be attached 

directly to the wall material or it may be attached via a linker. A "linker" as used herein with 
respect to the attachment of the agent is a molecule that tethers a light emitting compound or 
a quenching compound to the wall material. Linkers are well known in the art. Commonly 
used linkers include alkanes of various lengths. 

20 The agent is attached to the wall material in an amount sufficient to detectably interact 

with a partner light emissive compound. As used herein a "partner light emissive compound" 
is a light emissive compound as defined above but which specifically interacts with and 
undergoes energy transfer or quenching when positioned in close proximity to the agent. The 
amount of partner light emissive compound and the amount of agent required will depend on 

25 the type of agent and light emissive compound used. 

Another example of an article of manufacture of the invention is a wall material 
having a surface defining a plurality of channels and a station attached to a discrete region of 
the wall material adjacent to at least one of the channels, wherein the station is close enough 
to the channel and is present in an amount sufficient to cause a signal to arise from a 

30 detectable physical change in a polymer of linked units passing through the channel or in the 
station as the polymer is exposed to the station. A "discrete region" of the wall material 
adjacent to at least one of the channels is a local area which is surrounded by wall material 
not having a station. The area surrounding the discrete region does not interact with the unit 
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to produce the same characteristic signal produced by the interaction of the unit with the 
station. The discrete region is positioned in or near the channel such that the station at the 
dictreet region is exposed to the unit as it traverses the channel. 

An additional article of manufacture of the invention is a wall material having a 

5 surface defining a channel and a plurality of stations each attached to a discrete region of the 
wall material adjacent to the channel, wherein the stations are close enough to the channel 
and are present in an amount sufficient to cause a signal to arise from a detectable physical 
change in a polymer of linked units passing through the channel or in the station as the 
polymer is exposed to the station. 

10 As used herein a "plurality of stations" is at least two stations. Preferably a plurality 

of stations is at least three stations. In another preferred embodiment a plurality of stations is 
at least five stations. 

In a preferred embodiment the article of manufacture is a nanochannel plate. The 

following description of an optimal design of a nanochannel plate having fluorophores 
15 embedded within the plate is provided for illustrative purposes only. The example describes 

methods for optimizing several aspects of the article of manufacture. The description is in no 

way limiting of the article of manufacture claimed herein. 

Several examples of nanochannel plates are presented in Figure 4. Figure 4A shows a 

nanochannel plate (60) having layers of conducting material (62) and non-conducting 
20 material (64). The channel (70) has a diameter that is sufficient to encompass the passage of 

double-stranded, labeled DNA in a linear fashion. In this example donor fluorophores (68) 

are embedded in the clear non-conducting material in a concentric ring around each channel. 

The remaining portion of the nanochannel plate is made up of a light impermeable material 

(66). 

25 Figures 4B, C, D, E, and K show a nanochannel plate having fluorophores (68) 

attached to the surface of the wall material surrounding the opening produced by the channel 
(70). As shown in Figures 4B and 4D the fluorophores may cover the entire surface of the 
wall material. The fluorophores may also be concentrated around the channel opening as 
shown in Figures 4C, E, and K rather than covering the entire surface. Additionally, the wall 

30 material supporting the fluorophores may be a conducting layer (62) such as that shown in 
Figures 4D and E, the wall material may be a light impermeable layer (78) as shown in 
Figure 4K or the wall material may be a support layer (72) as shown in Figures 4B and C. A 
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support layer may be any type of wall material including but not limited to conducting, non- 
conducting, clear, light permeable, and light impermeable. 

Figures 4F, G, H, I, and L show a nanochannel plate having fluorophores (68) 
embedded in the wall material surrounding the channel (70). Again the fluorophores may 
5 extend across the entire wall material (shown in Figure 4F) or may be concentrated around 
the channel as shown in Figures 4G, H, and I. In the embodiments shown in Figures 4F, G, 
H, I, and L the fluorophores are embedded in a layer of non-conducting material (62) or of a 
clear material (74) or of a light permeable material (79). The layer having the fluorophores 
embedded within it may form the surface of the wall material as shown in Figures 4F and G 

10 or may be sandwiched between other layers. For instance the non-conducting layer (64) in 
Figures 4H and I is sandwiched between two conducting layers (62). The light permeable 
layer (79) of Figure 4L is sandwiched between two light impermeable layers (78). In some 
cases the layers shown form the entire wall material. In other cases the layers maybe 
adjacent to or sandwiched between supporting layers as shown in Figure 41. 

15 Figure 4 J shows a nanochannel plate having fluorophores (68) attached to the surface 

of the wall material surrounding the opening formed by the channel (70). The material 
surrounding all of the exposed surfaces of the wall material, including the surface within the 
channel is a conducting material. 

Figures 4M and N show a nanochannel plate having a luminescent thin film (76) 

20 positioned within the wall material surrounding at least a portion of the channel (70). The 
luminescent thin film either forms the surface of the wall material and is adjacent to a light 
impermeable layer (78) as shown in Figure 4M or may be sandwiched between two light 
impermeable layers (78) as shown in Figure 4N. 

Figures 40 and P show a nanochannel plate having two layers of fluorophores (68) 

25 either embedded in the wall material or attached to the surface of the wall material. In Figure 
40 the fluorophore layers (68) are attached to the surface of a conducting material (62) on 
either side of the wall material and surrounding the openings formed by the channel. In 
Figure 4P the fluorophore layers (68) are embedded in two light permeable layers (79) which 
sandwich a light impermeable layer(78). 

30 A preferred method of the invention involves the analysis of radiolabeled polymers as 

discussed above. Preparation of radiolabeled polymers such as DNA (for example, with 32 P 
or 3 H) is known in the art. The following description represents one of the many possible 
embodiments of analyzing radiolabeled polymers according to the methods of the invention 
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(Figure 4Q and R). A radiolabeled nucleic acid molecule (160) is analyzed with a single 
fabricated multilayered nanochannel (162). The nanochannel is the diameter of the labeled 
nucleic acid molecule and is constructed to yield a defined region of detection. The 
exemplary nanochannel plates shown in Figure 4Q and R are a heterogeneous multilayered 
5 structure consisting of two radiation impermeable layers such as lead or Lucite films (164, 
166) and a film of lower density material (or scintillation layer) (168) (i.e., conventional 
polymers, polymethylmethacrylate, polystryrene, Teflon, etc.). The lead films in Figure 4Q 
sandwich the layer of lower density material and are of such thickness as to prevent passage 
of radiation. The lower density material permits passage of radiation, thereby creating a 

10 defined region of radiation detection. As the radiolabel on the DNA passes through the 

defined region of detection, nuclear radiation is emitted, some of which will pass through the 
defined region of radiation detection. Figure 4R shows a nanochannel plate having low 
density material (168) surrounding the opening formed by the channel. The material 
surrounding all of the exposed surfaces of the wall material including the surface within the 

1 5 channel is a radiation impermeable layer ( 1 65). 

In a related embodiment of analysis of radiolabeled nucleotides (Figure 5), a detection 
system based on scintillation counting and multiple nanochannels is presented. A 
nanochannel array (170) is fabricated as shown in Figure 5. Multiple multilayered channels 
(172) exist for parallel amplification of data output. Each individual channel consists of two 

20 nuclear radiation shielding layers (1 74) which shield nuclear radiation, and a scintillation 

layer (176) which is fluorescently excited with exposure to nuclear radiation. The individual 
channels are separated from each other by a nuclear radiation shielding material. The nuclear 
radiation is prevented from reaching the fluorescent detection system by overlaying with 
optical quality Lucite (or any other transparent material which prevents the passage of 

25 nuclear radiation). This allows only the fluorescent signal to reach the detection system. 

Each of the above described nanochannels is only an example. It is, therefore, 
anticipated that each of the limitations described with respect to these embodiments involving 
any one element or combinations of elements can be included in each nanochannel. 
Preparation of films having multiple layers of differing material have been described in the 

30 art, e.g., US Patent No. 5,462,467, Ferreira et. aL, Thin Solid Films 244:806-809 (1 994). 

In the example provided donor fluorophores are concentrated in a concentric ring 
around each channel in order to optimize the donor intensity. Intuitively, a concentric ring is 
preferred because the range of energy transfer is limited in part by a radial Forster distance. 
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Examining one channel in detail will illustrate the marked changes in donor intensities that 
can occur for the concentric ring configuration. Figure 6 shows a single channel (82) having 
the concentric ring configuration. The outermost concentric ring (86) marks the edge of the 
donor fluorophores situated around the channel. The middle concentric area (84) denotes the 

5 region where the rate of energy transfer from the donor to the acceptor is greater than the rate 
of emission of the donor fluorophores. 

The rationale for embedding the donor fluorophores in a solid medium is best 
understood by examining the mechanisms of photobleaching and quenching. The factors that 
may diminish a fluorescent signal include: photobleaching, temperature quenching, energy 

10 transfer quenching, collisional quenching, excited state reactions, and solvent effects. These 
mechanisms are similar in that they all arise from either collisional or bimolecular events. A 
solid medium is a physical barrier to these undesired molecular processes and is a means of 
isolation of the donor fluorophores. For example, the mechanism of photobleaching is due to 
a reaction between an excited fluorophore and oxygen to form a non-fluorescent product 

15 (Menter et al., 1978; Menter et al. 1979; Giloh and Sedat, 1982). A solid medium, especially 
one manufactured in an oxygen-free environment, prevents oxygen from reaching the 
embedded fluorophores, eliminating possible oxidative reactions (Garland and Moore, 1979; 
Rost, 1990). In order to understand the effect of a solid medium further, a brief summary of 
the other quenching effects is presented. Temperature quenching is a decrease in 

20 fluorescence as temperature increases. The degree of temperature dependence depends on 
the compound; it is usually about 1% change in quenching per degree Celsius (Guilbault, 
1973). The effect is believed to be due to increased molecular motion and increased 
frequency of collisions, resulting in increased probability of transition to the ground state 
before fluorescence can occur. Collisional quenching is a broad category consisting of many 

25 possible mechanisms involving direct contact between a fluorophore and another molecule 
(Lakowicz, 1983). Excited state reactions involve the reactive excited state of a fluorophore 
in a reaction with nearby solvent molecules and constitutes a subset of collisional quenching 
(Porter, 1967; Zweig, 1973). Solvent effects consists of solvent-fiuorophore collisions and 
interactions, including effects of hydrogen bonding, acid-base chemistry, and charge transfer 

30 interactions (Lakowicz, 1983). Energy transfer quenching is due to the effects of impurities 
on fluorophores via undesired FRET. From this short listing of photobleaching and 
quenching mechanisms, it is clear that molecular interactions with fluorophores are the main 
causes of diminished fluorophore emissions. A solid media thus eliminates these undesired 
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molecular interactions, collisions, and reactions by isolating the fluorophores and creating a 
physical barrier that prevents the entry of undesired quenching molecules. 

The equation defining energy transfer should be examined in order to determine the 
effect of a solid medium on energy transfer. The equation that needs to be considered 
5 describes the Forster distance (Selvin, 1995; Cantor and Schimmel, 1980; Wu and Brand, 
1994; Clegg et al., 1995; Fairclough and Cantor, 1978): 



J is the normalized spectral overlap of the donor emission and acceptor absorption, (j) D is the 
quantum efficiency (or quantum yield) for donor emission in the absence of acceptor (c))d is 

10 the number of photons emitted divided by number of photons absorbed), n is the index of 
refraction, and k 2 is the geometric factor related to the relative angle of the two transition 
dipoles. Embedding the donor fluorophore in a solid medium raises n, the index of 
refraction. In solvent systems, n is taken to be around 1 .35 (Selvin, 1995), slightly higher 
than the value for water (1 .33). Some typical values for solid media are 1 .46 for fused quartz, 

15 1 .52 for crown glass, and 1 .66 for dense flint glass. A ratio of the R Q 's for a solid medium 
and a solvent system can be used to determine the magnitude of the effect of changing the 
index of refraction: 



where the primed values are the ones for the solid medium and the unprimed ones are for a 
20 solvent system. Assuming n ' = 1 .5, the ratio becomes 0.93. The Forster distance thus only 
changes by 7%, a value that can be easily corrected for by using donor-acceptor pairs that 
have a higher original R 0 . 

Referring again to Figure 6, conceptually, without presenting equations, the rate of 
energy transfer (transfer) from the center of the channel falls off with increasing distance from 
25 the center (Figure 6). It is also known that the rate of donor emission (k emU ) is uniform over 
the complete area of donor fluorophores in the absence of acceptors (Figure 6). The area 
closest to the channel, and thus to the acceptor, is quenched completely. At all areas less than 
Requai, the donor fluorophore is quenched completely because ^transfer is greater than k emit . At 
areas greater than the Requai, the rate of donor emission is greater than the rate of energy 
30 transfer so quenching is incomplete. 
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If the concentric ring of donor fluorophores has a radius equal to Requai, then a 
hundred percent intensity change can occur for the donor fluorophores. This means that upon 
the exit of an acceptor labeled nucleotide through the channel, the acceptor is detected with 
perfect efficiency. Recalling that confidence intervals are related to the SNR of the system, 
5 the minimum signal to noise ratio needed to generate a 99.9% confidence interval for such a 
change is calculated to be 6:1. The SNRs for the detection systems proposed are greater than 
600:1. 

A quantitative explanation is presented at this time to calculate Rs qu ai. The equations 
for the rates of donor emission and energy transfer are presented. The rate of donor emission 
10 (kemtt) is as follows: 



where lis the intensity of light incident upon the donor fluorophores, e is the molar extinction 
coefficient of the donor fluorophore, r is constant for fluorescence emission (3.8 x 10" 21 mol 
cm 3 /L), iVis the number of donor fluorophores, h is Planck's constant (6.6261 x 10" 34 J s), 
15 and o is the frequency of excitation light. 

The rate of energy transfer (k lra nsfer) from many donors to one acceptors is derived 
from the original Forster rate equation (Forster, 1965) for one donor and one acceptor. The 
original equation is given as: 



20 where k is the rate of energy transfer from one donor to one acceptor, R 0 is the Forster 

distance, t is the fluorescence lifetime of the donor, and r is the distance from the donor to 
the acceptor. The derivation of a multi-donor system is straightforward and follows from the 
discussion of the multi-donor system described above. 



25 system. First, there is the saturation limit imposed by the lifetime of the acceptor. The 

acceptor is only able to be excited whenever it is in the ground singlet state. For an acceptor 
with a lifetime of 1 ns, the upper limit is 1 x 10 9 excitation events/second. This large 
saturation level is hardly a concern given the low rate of excitation for a single fluorophore 
(25,000 excitation events/second). The second limitation is the probability of simultaneous 

30 de-excitation of donor fluorophores. As calculated above only a very small number of 
simultaneous de-excitation events can occur. Since the acceptor is not saturated with 
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There are two limitations in the amount of energy transferred for a multi-donor 
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excitation events resulting from energy transfer and the probability of simultaneous donor de- 
excitation is small, the rate of energy transfer for a multi-donor system is directly 
proportional to the number of donor fluorophores (N): 

K transfer 

Td \ r ) 

With this equation, the radius at which ktransfer is equal to k emit (Re qu ai) is found by 
equating the two rate equations and solving for r. 
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Solve for r. 
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A numerical value for Re qua i can be calculated. Table 3 lists the values for the variables and 
the reason the particular value is chosen. 



Table 3 


variable 


value 


reason 


Ro 


60A 


range for Forster transfer is 20 A - 100 A 




1 x 10" & s/photon 


fluorescent lifetimes range from 1 ns - 20 ns 


h 


6.6261 x 10" J£f Js 


Planck's constant 


V 


6.1224xl0 14 xs- J 


c=vX; A,=490nm (excitation of fluorescein) 


1 


30 W/cm z 


intensity of 2 W laser is given by PI A. A=beam area 
(r=2mm). Intensity of laser is 64 W/cm 2 


s 


91,0001/Mcm 


molar extinction coefficient for fluorescein 


p 


3.8 x 10-21 Mcm J 


constant for fluorescence emission of fluorescein 


Requai 


350 A 


from above equation 



Requai is calculated to be 350 A, within an order of magnitude of the Forster distance. 
This means that a concentric ring of fluorophore around a channel with a radius equal to 350 
A will give rise to a hundred percent change in signal intensity upon the passage of an 
acceptor label. In practice, having such a large donor intensity decrease is unnecessary from 
both the standpoint of signal detection and the need to resolve adjacent bases. For example, 
with a SNR of 600:1, only a 0.50% intensity change will give rise to a 99.9% confidence 
interval. Resolution between adjacent bases is possible by looking at further decreases in the 
donor emission when there are two acceptors instead of one in the donor layer. If one 
acceptor already decreases the donor emission to zero, then an additional acceptor will not be 
detected because the donor emission cannot decrease further. This aspect is discussed in 
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detail below. An example of a method for constructing a concentric ring of donor 
fluorophores around each channel by using a photolabile protecting group and light 
diffraction is also provided below. 

Nanochannels having a channel diameter size of at least 1 nm are commercially 
5 obtainable in the form of polycarbonate filters from Poretics, Inc. and can be made on order 
by Corning Separations, Inc. There are several methods that can be used to create 
nanochannels of the desired diameter. 

One method for preparing a nanochannel plate is by a track-etch procedure which 
produces cylindrical pores of uniform diameter in a membrane material. Microporous and 

10 nanoporous polymeric membranes having pore diameters on the order of 10 nm and with 

pore densities approaching 10 9 pores per square centimeter can be prepared by the track-etch 
method (R.L. Fleischer, P.B. Price, R.M. Walker, Nuclear Tracks in Solids (Univ. of 
California Press, Berkeley, CA (1975)). The manufacture of pores via track-etch is a two 
step process. In the first step, thin polycarbonate (or other polymeric material) film is 

15 exposed to collimated, charged particles in a nuclear reactor. As these particles pass through 
the polycarbonate material, they leave sensitized tracks. The density of the tracks is 
controlled by varying the amount of time the film is in the reactor. In the second step, the 
tracks left by the particles are preferentially etched, or dissolved, into uniform, cylindrical 
channels. The diameters of the perforations can be controlled by the residence time of the 

20 etchant on the film. Many examples of methods for forming track etched membranes have 
been described in the art, e.g. European patent Application No. 83305268. 1 , Publication No. 
0109147, to Varian Associates, Inc., U.S. Patents Nos. 3,303,085; 3,662,178; 3,713,921; 
3,802,972; 3,852,134, 4,956,219, 5,462,467, 5,564,959 and 5,449,917, each of which is 
incorporated herein by reference. 

25 The commercially available membranes are generally prepared from polycarbonates 

or polyesters; however, a number of other materials are amenable to the track-etch process 
(Id.). For instance, other polymeric materials include but are not limited to polystyrenes, 
aromatic polyesters, polyolefins, including polyethylene, polyethylene terephthalate, 
polypropylene, vinyl plastics such as polyvinyl difluoride (PVDF), and cellulose esters such 

30 as cellulose nitrate, cellulose butyrate and cellulose acetate. If the nanochannel plate of the 
invention is prepared by a track-etch technique it may be formed from any material capable 
of being track etched can be used to form the track etched membrane. 



-93- 

Devices for performing bombardment of materials with high energy particles are well 
known in research and industry. The particles used to form the tracks may be generated by a 
charged particle accelerator, such as an electrostatic accelerator (e.g., a Van de Graaff 
accelerator or Tandem accelerator), a linear accelerator or a cyclic accelerator such as a 
5 cyclotron or any other means known in the art. 

Once the damaged track is formed in the film the channels or pores are formed by 
selectively etching the film with a gas or liquid. The residence time of the etchant determines 
the size of the channels. The track etched film is exposed to the etchant for sufficient time to 
generate channels that are sized to match the desired application, which varies depending on 
10 the type of polymer being analyzed and the type of analysis. The channel diameter can be 
measured using a scanning electron microscopy (SEM) according to methods disclosed in 
Basic Principles of Membrane Technology, M. Mulder, Klumer Academic, 1991. 

A second method of creating nanochannels of defined diameter is to use a 
combination of track-etching and surface coating. A polycarbonate membrane of a diameter 
1 5 greater than the desired nanochannel device is coated with a thin film of material with a 

defined thickness. The resulting structure is a polycarbonate membrane surface coated to the 
desired diameter. The first layer of thin film that is added to the nanochannel plate is a 
conducting layer. A conducting layer helps to resolve adjacent bases (which is discussed 
below in more detail). 

20 Thin layers of conducting polymers are added to the polycarbonate membrane 

through solvent deposition. Solvent deposition of conducting polymers have been described 

(Cheung et al., 1994; Fereira et al., 1994; Fereira and Rubner, 1995; Fou and Rubner, 1995). 

The following is excerpted from Fou and Rubner, 1995: 

We describe the solution chemistry and methodologies needed to utilize the 
25 layer-by- layer processing technique described... to manipulate conducting 

polymers such as polypyrrole and polyaniline into multilayer thin films with 
angstrom-level control over both film thickness and film architecture. 
Ultrathin films with conductivities over 300 S/cm can be made. The process 
involves the spontaneous adsorption of monolayers of electrically conductive 
30 polymers onto substrates from dilute solutions. Subsequent multilayer thin 

films are created by alternate deposition with soluble polyanions. The 
thickness of the thin films can be precisely controlled to the angstrom level and 
can range between 5 A and greater than 1000 A. The conductive polymers 
used are polypyrrole and polyaniline because these can be made extremely 
35 conductive (300 S/cm) for ultrathin layers (-50 A). 
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The advantage of using a solvent deposition method to create the desired 
nanochannels is two-fold. First, nanochannels of any particular diameter can be created with 
precision and accuracy. Second, since the first layer of the nanochannel plate is a conducting 
layer, it is only convenient to add it. An additional advantage, though not significant, is that 

5 addition of a thick layer of conducting material can lower the peak-to-valley distance of the 
polycarbonate surface, creating a more uniform surface. 

Another method for preparing a nanochannel device of the invention is through the 
production of arrays of carbon nanotubes. lijima demonstrated the production of multiple 
concentric cylindrical shells of hexagonally bonded carbon atoms which can serve as 

10 catalytic surfaces to confine species in a 1-dimensional space. lijima, Nature, 354:56 (1991), 
see also US Patent No. 4,424,054. 

Li, W. Z., et. al., has also reported a method for producing large areas of highly 
ordered, isolated long carbon nanotubes. The method is based on a chemical vapor 
deposition which utilizes mesoporous silica containing iron nanoparticles embedded in the 

1 5 pores rather than carbon black and graphite or silica covered with transition metal 

nanoparticles. The following method is disclosed in Li, W. Z. et al.: 

Mesoporous silica containing iron nanoparticles were prepared by sol- 
gel process from tetraethoxysilane (TEOS) hydrolysis in iron nitrate aqueous 
solution. Analytically pure TEOS (10 ml) was mixed with 10.4 ml of 

20 analytically pure ethyl alcohol and 0.1 M iron nitrate aqueous solution (1 1 .4 

ml) by magnetic stirring for ~ 30 min. A few drops of concentrated hydrogen 
fluoride (0.2 ml) were then added, and the mixture was stirred for 15 min. 
After gelation of the mixture, the gel was dried for 1 week at 60°C to remove 
the excess water and other solvents. The gel was then calcined 10 hours at 

25 450°C at 10- 2 torr. A silica network with relatively uniform pores was 

obtained with iron oxide nanoparticles embedded in the pores. The iron oxide 
nanoparticles were then reduced at 550°C in 180 torr of flowing 9% H2/N2 
(110 cm 3 /min) for 5 hours to obtain iron nanoparticles, which have a high 
catalytic activity. Subsequently, a mixture of 9% acetylene in nitrogen was 

30 introduced into the chamber at a flow rate of 1 10 cm 3 /min, and carbon 

nanotubes were formed on the substrate by deposition of carbon atoms 
obtained from decomposition of acetylene at 700°C. The samples were 
examined by a scanning electron microscope (SEM) (S-4200, Hitachi) before 
and after carbon deposition, and energy-dispersive x-ray spectra (EDX) were 

35 recorded by a SiLi detector attached to the SEM. To prepare a transmission 

electron microscope (TEM) specimen, the sample was ground in a mortar and 
suspended in ethanol; a drop was then placed on a holey carbon copper grid 
and examined in a JEM 200-cx microscope to characterize the carbon 
nanotubes. 



40 
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Additionally, nanochannels may be prepared from anodic porous alumina which is a 
packed array of columnar hexagonal cells with central, cylindrical, uniformly sized holes 
typically ranging from 4 to 200 nm in diameter. Membranes of this type are prepared 
electrochemically from Aluminum metal (A. Despic and V.P. Parkhutik, in Modern Aspects 

5 of Electrochemistry, J.O. Bockris, R.E. White, B.E. Conway, Eds. (Plenum, New York, 

1989), vol. 20, chap. 6.). Pore densities as high as 10 11 pores per square centimeter have been 
achieved (D. AlMawiawi,N. Coombs, M. Moskovits, J. Appl. Phys. 70, 4421 (1991)). 
Membranes having pore diameters as small as 5 nm have been prepared using this method 
(and it is believed that even smaller pores can be prepared). Martin, C.R., Science, 266: 1961 

10 (1994). 

Matsuda and Fukuda have described a modification of porous alumina membranes. 
The membranes which are a highly ordered metal (platinum and gold) nanohole array are 
prepared using a two-step replication of the honeycomb structure of anodic porous alumina. 
Preparation of the negative porous structure of porous alumina followed by the formation of 

15 the positive structure with metal results in a geometrical structure identical to that of anodic 
porous alumina. The method, therefore allows the preparation of the hole array of anodic 
porous alumina with desired materials other than alumina. Matsuda and Fukuda, Science, 
268:1466 (1995). Matsuda and Fukuda's procedure is summarized below. 

Anodic alumina was produced using a long period anodization of alumina on a 

20 substrate at 40 V which results in a minimum number of defects and dislocations, followed 
by removal of the aluminum substrate and the bottom part of the porous layer with saturated 
HgCl 2 . The material was then dipped in 5% (by weight) phosphoric acid solution at 30°C to 
adjust the pore diameter. A thin metal layer was deposited on the bottom of anodic porous 
alumina by vacuum deposition in order to create a catalyst or electrode for the subsequent 

25 metal-plating process. Generally the same metal as that used to create the scaffold is used for 
the evaporation. Metal methacrylate monomer containing a polymerization initiator such as 
5% (by weight) benzoyl peroxide was injected into the holes under vacuum conditions and 
was polymerized by ultraviolet irradiation. The alumina layer was then removed with NaOH 
to produce a negative porous alumina-type structure of poly(methyl methacrylate) (PMMA). 

30 A positive structure was formed from the PMMA mold by electroless deposition of platinum 
as follows. The negative type of PMMA was dipped into the electroless plating solution, 
causing metal deposition to start at the bottom part of the cylindrical structure and the metal 
to gradually fill the cavity of the PMMA to the top of the negative type of PMMA. 
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Alternatively a gold (Au) hole array was prepared using electrochemical deposition of Au 
into the microcavity of the PMMA under constant conditions. The porous metal was 
obtained by dissolving the PMMA with acetone. 

Other nanoporous materials which have been described in the art include a 
5 nanochannel array glass with pore densities as high as 3 X 10 10 pores per square centimeter 
(RJ. Tonucci, B. L. Justus, A.J. Campillo, C.E. Ford, Science 258, 783 (1992) and Pearson 
and Tonucci, Science, 270: 68 (1995)). Douglas et al. have shown that the nanoscopic pores 
in a protein derived from a bacterium can be used to transfer an image of these pores to an 
underlying substrate (Douglas, et. AL, Science 257: 642 (1992)). Finally, Ozin has discussed 

10 a wide variety of nanoporous solids that can be used as template materials (Ozin, G. Adv. 
Mater. 4:612 (1992)). Nishizawa et. al., describe the production of metal nanotubule 
membranes having radii as small as 0.8 nm (Nishizawa et. al., Science 268:700 (1995), 
describing nanotubules formed by plating gold onto the walls of pores in a commercially 
available polycarbonate filtration membrane (Poretics) containing cylindrical nanopores of 

15 uniform radius (25 nm, 6 X 10 8 pores per square centimeter) running through the complete 
thickness (6 um) of the membrane. "The thickness of the Au layers deposited on the pore 
walls can be controlled by varying the plating time. As a result, the inside radius of the Au 
nanotubules can be varied at will [as determined from measurements of gas (He) flux across 
the tubule-containing membrane.]" Berggren et. al., have demonstrated techniques for 

20 nanolithography using self-assembled monolayers and a beam of neutral inert gas. Berggren 
et. al., Science 269: 1255-1257 (1995). 

La Silva et al., describe a technique for fabricating simple metal structures with a 
scanning tunneling microscope (STM) which have dimensions of 10's to 100's of nanometers 
and are partially electrically isolated from their environment. The method is performed by 

25 depositing a very thin metal film on an insulating substrate, and using the tip to machine gaps 
through the film where lateral electrical insulation is desired. (Journal of Vacuum Science & 
Technology B, (1993) 11:1992-1999) 

The wall material may be constructed in a manner which is optimal for resolving 
adjacent units of the polymer. Since the purpose of the wall material is to provide an 

30 environment which is conducive to generating a signal, the materials used to prepare the wall 
material may be selected to aid in this process. For instance, the wall material surrounding 
the agent if the agent is embedded in the wall material preferably is a non-conducting or light 
permeable material. At least two other mechanisms for optimizing the wall material in an 
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apparatus having fluorophores embedded in the wall surrounding the channel, in order to 
resolve adjacent labeled bases include the use of thin conducting layers and controlling the 
radii of donor fluorophores around each channel. 

Conducting layers prevent Forster energy transfer through electrical shielding. Figure 
5 6 demonstrates a configuration of the conducting layers relative to the non-conducting layer 
which contain the donor fluorophores. The conducting "sandwich" creates a defined region 
where energy transfer can occur and this helps to optimize base resolution. Forster energy 
transfer arises because of an electrical dipole-dipole interaction between a donor and 
acceptor. The rationale for the use of conducting layers when the signal generated is 
10 dependnet on FRET is that FRET is electric field-dependent and thus placing an electric 
shield between the donor and acceptor will prevent energy transfer. It is necessary to 
understand fully the basis of energy transfer. Incident excitation light creates an electric field 
in the donor because the light induces transitions in the donor, or causes electrons in the 
donor to oscillate (Selvin, 1995). This creates an induced, electric dipole moment in the 
15 donor, which in turn, creates its own electric field. Energy transfer arises when an acceptor is 
placed in the donor's electric field. As a consequence, there are induced transitions in the 
acceptor which create an induced dipole moment, p A . The size of the dipole moment is 
related to the size of the donor electric field: p A =a A E D , where a A is the polarizability of the 
acceptor. The amount of energy absorbed by the acceptor is p A E D , = a A E D 2 , which translates 
20 into the 1 /R 6 dependence of the rate of energy transfer. 

The theory of electrical shielding is found in most introductory physics books. 
Purcell, 1985 provides a clear explanation. The potential function inside the box, \\i (x, y, z), 
must satisfy Laplace's equation, T 2 y = 0. Given the knowledge of conductors, the 
boundary of the conductor is an equipotential, meaning that \\f = \\f 0 a constant function 
25 everywhere on the surface of the conductor. An obvious solution to Laplace's equation is \|/ 
= \[/ 0 throughout the volume. According to the uniqueness theorem, there can only be one 
solution, meaning the answer is \\> = a constant. The electric field of a constant potential 
function is zero because E = -grad The electric field is thus zero everywhere inside the 
box. 

30 The model of a box with an open hole can be applied to a nanochannel plate because 

essentially a nanochannel plate is a conducting box with many "holes". Consider the Figure 
7 A and 7B. Figure 7A shows a complete nanochannel plate having nanchannels (90). The 
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layer of donor fluorophores (88) is enclosed by the conducting "box." In this manner, there 
cannot be energy transfer to the donor fluorophores from outside the box because of electrical 
shielding (92). Recall that the basis of energy transfer is electric. At the boundary condition 
of the nanochannel, which is the outer surface of the nanochannel plate, v|/ = \|/ 0 , an 
5 equipotential surface. A logical solution to Laplace's equation is that everywhere inside the 
boundary condition that , i|/= constant. From the uniqueness theorem, this is the only 
possible answer and thus everywhere inside the conducting nanochannel "box," E = 0 
because E = \|/-grad. The conclusion is that acceptor fluorophores cannot undergo energy 
transfer from outside the box. Even inside a nanochannel, as shown in Figure 7B, energy 
10 transfer is limited geometrically. At any position where the acceptor molecule is not in the 
plane of the donor fluorophores, the amount of energy transfer is limited. In Figure 7B, the 
amount of donor fluorophores that cannot undergo energy transfer is enclosed by an angle of 
9, which for all purposes is very large unless the acceptor fiuorophore is directly in the plane 
of the donor fluorophores, which is the desired configuration. 
15 The consequence of the conductive layers is that a sharp signal is created upon the 

passage of a labeled nucleotide through the nanochannel. Figure 8A and Figure 8B 
demonstrate signal generation upon passage of an acceptor label through the nanochannel. 
Figure 8 A shows an enlarged view of one nanochannel (98). Only part of the conducting 
layers (94) is shown. The light impermeable polycarbonate layer is not shown. An acceptor 
20 label (1 00) on a strand of DNA moves through the nanochannel from bottom to top, starting 
at position A and moving to position C. FRET can only occur at position B because the 
conducting layers shield any interaction of the donor fluorophores (96) with the acceptor at 
positions A and C. Figure 8B graphically illustrates the sharp changes in donor intensity as 
an acceptor label moves from positions A to C. The middle region on the graph denotes the 
25 level of the donor fluorophores. The outer regions on the graph denote the level of the 

conducting layer. At the interface of the conducting and donor fiuorophore layers, there are 
dramatic changes in donor intensity due to electrical shielding. 

It is clear that resolution between adjacent bases can be resolved with conducting 
layers. By creating a conducting "sandwich" where the thickness the donor fluorophores is 
30 less than the helical rise of B-DNA (3 .4 A), the desired resolution can be achieved. Thin 

films of this thickness can be constructed easily with plasma, solution, chemical vapor, or ion 
beam deposition methods (Spohr, 1990; Valiev, 1992; Konuma, 1992; Pauleau, 1995; Bruno 
et al., 1995; Dash, 1975; Stuart, 1983; Morosanu, 1990) However, it is not desirable to use a 
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donor fluorophore film less than 3.4 A, as will be explained under the next heading. In brief, 
donor fluorophores embedded in a thicker layer allows the measurement of instantaneous rate 
of movement of DNA. By measuring the time a labeled nucleotide spends in the thick layer 
and knowing the dimensions of the layer, the rate of DNA movement is known, which is 
5 important for determining distances between labeled nucleotides. 

Another example of a method for precisely resolving adjacent labeled bases is to 
control the radii of the donor fluorophores around each nanochannel. The amount of energy 
transferred for two acceptor labels in the presence of a concentric ring of donor fluorophores 
is greater than the energy transferred for one acceptor label. Detection of the difference in 
10 energy transferred for one and two acceptors allows the resolution of adjacent bases. In order 
for the donor fluorophores to be able to interact with more than one acceptor, the thickness of 
the donor fluorophores has to be greater than the helical pitch of DNA. Furthermore, the 
radii of the donor fluorophores must be greater than for one acceptor (see Figure 6). A 
radii greater than R« qua ] for one acceptor allows for further decreases in donor intensity in the 
1 5 presence of more than one acceptor. A radii at Re qU ai means that upon that passage of one 

acceptor, the donor intensity decrease is equal to 100%. In this case, passage of two adjacent 
acceptors gives the same detected signal as one acceptor. 

Figure 9 schematically demonstrates the passage of a two-base labeled strand of DNA 
(102) through a nanochannel (104) with the proper thickness and radii of donor fluorophores 
20 (108) sandwiched between conducting material (106). The positions labeled "A" through 
"D" correspond to the labels on the graph shown at the right of the illustrations. Initially, the 
acceptor labels on the DNA are at position A. Energy transfer is not possible at this position 
so donor intensity remains at a maximum. Further movement of the DNA allows one 
acceptor to undergo energy transfer with the donor fluorophores (B) and a sharp decrease in 
25 the donor intensity occurs. At position C, two fluorophores can undergo energy transfer 
yielding a further decrease in the donor intensity. Finally, the two acceptor labels exit the 
region of donor fluorophores, energy transfer is not longer possible, and the donor intensity 
returns to a maximum (D). 

The change in donor emission in the presence of one and two acceptors can be 
30 visually demonstrated without mathematical quantitation. Figure 10A illustrates the amount 
of change as volumes. The illustrations show the amount of energy transfer as a solid volume 
(110). The original donor intensity is represented as the volume in the shape of a disc (112). 
Figure 10B shows the change for one acceptor. The right illustration is for two acceptors. 
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The decay curves represent the rate of energy transfer with respect to radial distance as given 
by Forster's equation. The decay curve for two acceptors is roughly double that of one 
acceptor. The rates of donor emission and energy transfer are expressed in units of 1/s nm 2 . 
Integration of the rate of donor emission decrease over the surface area of energy transfer 
yields the net decrease in donor emission. This is represented as the shaded area under each 
decay curve integrated over 2 n. Knowing the original donor emission (shaded rectangular 
area integrated over 2 n), the percent decrease in donor emission can be found. 

Mathematically, the changes can be easily calculated. In both cases, the change is 
equal to the striped area (Figure 10) integrated over 2 n. To calculate, the rates of donor 
emission and energy transfer have to be expressed in the appropriate units of 1/s nm 2 . To do 
so, the density of the donor fluorophores, given as N/A, is to be used in the rate equations, 
where No is the number of fluorophores and A is the area which is occupied by the donor 
fluorophores. Accordingly, the rate of emission is given as: 

_ I4pn d 

kemit hvA 
The rate of energy transfer becomes: 




Where N A is the number of acceptors that can undergo energy transfer with the 
donors. The general equation for the striped areas integrated over 2 % follows: 

VCR 2 ** - Rl^eO K emit + f k lransfer rdrdO 

For the present calculations, R^nor = 55 nm, Nd =1000. The donor fluorophore density 
(Nr/A) becomes 0. 1 1 fluorophores/nm 2 . This value is not unreasonable because the size of 
the area of the largest possible fluorophore is 1 nm 2 . This means that the density is at least an 
order of magnitude lower than the highest possible fluorophore density. 

In order to solve the general equation, recall that Re qua i can be solved by setting kemit = 
ktransfer, resulting in the following: 

r - K° 

It is also important to know the original donor emission (E a ), given as: 

E 0 ~ kemil^l Rdonor ~ Rchamtel) 
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Using the values from table 3 and from above, it is possible to tabulate donor 
emission values for different numbers of acceptors that are in the position to undergo energy 
transfer (table 4). Values for Na = 1-5, 11-12 are calculated as examples. It is expected that 
the value of the first donor emission decrease be the greatest. Subsequent decreases become 

5 progressively smaller. When Re qU ai approaches Rdonor, it is expected that actual (not percent) 
change approaches zero. This is so because essentially the donor molecules are almost 
completely quenched. 

The percent changes and the signal-to-noise ratios determine the detection capability, 
not the absolute numerical changes in donor emission. As expected, the SNR decreases as 

10 the number of acceptors increase because there is greater donor quenching. Otherwise stated, 
the donor emission becomes smaller. This decrease in SNR is compensated by increasing 
percent changes from N A = 5 to 12. The confidence is calculated by using the SNR and the 
percent change. For example, the confidence for detecting the change from one to two 
acceptors uses the SNR for one acceptor and the percent change from one to two acceptors. 

15 In this case, there is a 95% confidence for detecting a 0.483% change. Since the percent 

change is high, 29.1%, the signal change from one to two acceptors is detected with a 100% 
confidence. The calculations from above demonstrate that multiple adjacent acceptors can be 
detected with high efficiency. 



Table 4 


N A 


donor emission 


% change from 


SNR (80% full well 


confidence 




(photons/s) 


previous 


capacity at Eo, Na=0 




0 


2.5572x 10' 




632:1 


-100% 


1 


1.0943 x 10' 


57.2% 


413:1 




2 


7.7510 x 10° 


29.1% 


348:1 




3 


5.7650 x 10° 


25.6% 


300:1 




4 


4.3560 x 10° 


24.4% 


261:1 




5 


3.2440 x 10° 


25.5% 


225:1 














11 


3.8200 x 10 3 




77:1 




12 


2.1200 xlO 3 


44.5% 


58:1 





20 

An assumption made in the above calculations is that the donor emission remains 
constant during the passage of one fluorophore through the donor fiuorophore layer. Recall 
that the donor fluorophore layer may consist of a monolayer of fluorophores embedded in a 
clear non-conducting medium. The range of energy transfer for an acceptor close to the exit 
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of the nanochannel is less than the range upon initial entry into the channel. This change is in 
fact significant, as shown mathematically below. Taking into account the change yields 
higher signal-to-noise ratios and greater changes in donor intensity for additional acceptors. 
This means that the values in table 4 which show very efficient signal generation/detection 
already are actually even slightly higher. 

The change in donor emission upon passage of an acceptor through the donor 
fluorophore layer is determined by calculation in the following formulas. It is expected that 
the amount of energy transfer decreases as an acceptor passes through the donor layer due to 
a smaller effective energy transfer range. The consequence of this decrease is that the donor 
emission is greater than in the previous calculations. A higher donor emission means a 
higher SNR. Suppose an acceptor has entered the donor layer a small distance. A short time 
later, another acceptor enters the donor layer. The presence of an additional acceptor yields a 
decrease in the donor emission. The percentage change is large. In fact, it is larger than the 
previous calculations. The combination of higher SNRs and greater percentage changes 
mean that detection efficiencies are greater than those previously estimated. Complex 
patterns of labeling, such as labeling every several nucleotides, can be distinguished using 
this system. 

The equation of donor emission when the acceptor is on the same plane as the donor 
fluorophores is given as: 

k emit n(R 2 donor - R 2 hannel ) - [k emit n(R 2 equal - R 2 channe! ) + C don ° r \ Rmml rdrdO] 
Recall the equations for kemit and k tra nsfer: 

_ I&N D _ N D ,R 0 )6 

Kemit ; ' Ktransfer . ( / 

hvA r D A r 

Express the original donor emission function in terms of the radial distance (x) and 
distance of acceptor from donor layer (d) with the following substitutions: 



dr= X = dx r = ^lx 2 + d 2 

4x 2 +d 2 

The resulting equation, together with the substitution of the equation for k^ngf^ yields: 

kemu X( Rdonor " Rchannel) " [ k em U M xlqual ~ R-lhannel) + (2it —^f) J ( dx 

r D A x f^ (x +d ) 

Let u= x 2 + d 2 . It follows that dx = du/dx. 



k em uMR 2 donor-Rlhar m el)-[k em uMxl qm l-Rl ham el) + 2n^^ J ( du] 



kemn K(R 2 d onor - Runnel) " [ k em n M ' X^uai " Runnel) + ^f^"— 1 



tdA 4( x 2 equal + d 2 f 4(Ri„ or + d 2 f 
Substitute x equal = VR 2 equ ai - d 2 and the resulting emission function becomes: 

N R 6 1 1 

5 E(d) = k eml MRlonor ~ Rlannel) " [ k emit K( R^ual ~ d' " R^annel) + (27C—^)( — -— —f^)J 

TDA 4 Requal 4( R donor + d ) 

The resuling donor emission function can be plotted versus distance. 



Distance 


Donor Emission 
(photons/s) 


% Change from Original 


0 


1.09425 x 10' 




20A 


1.09740 x 10' 


0.023 % 


40A 


1.0692x10' 


1.158% 


60A 


1.12278 xl0 y 


2.607 % 


80A 


1.14501 x 10' 


4.638 % 


100A 


1.17365 x 10' 


7.256 % 



N D R 6 1 

El (d) = k emlt 1*( Rdonor " Rdumnel) " I kemit M xlqual " Rlhannel) + (2tT^ L ^)( —— - 

TDA 4 ( Requal ~* 



1 



d 2 f 4( R 2 donor + d 2 )■ 



~ 2 ) 



Y(2n*£*)(-JL—. 4 

TDA 4 R equa i 4 Rdonor 



)] 



In order to find solutions to the above equation, Re qu ai needs to be found by equating 
kemit to the sum of energy transfer of the two acceptors. Solving for Requai in trie below 
equality is done by computer: 

R 6 o 

Td Requal 



I&N D _Nd( Ro j6 + Nd 

hv Td Requai Td ( Rlqual + d* f 



kemit " ktransfer(l) + ktransfer(2) ~ 

The percent change in donor emission from one to two acceptors is tabulated. 



Distance (d) 


Requai 2 acceptos 


E 2 (d) 


Ei(d) 


% change 


SNR 


OA 


39.28 


7.7510x10* 


1.0943x10' 


29.10% 


348.0 


1 


20A 


39.21 


7.7661x10 s 


1.0974x10' 


29.24% 


348.3 


1 


40A 


39.11 


7.8102x10° 


1.1069x10' 


29.44% 


349.3 


1 


60A 


39.01 


7.8815xl0 a 


1.1228x10' 


29.80% 


350.9 


1 


80A 


38.82 


7.9790xl0 6 


1.1450x10' 


30.31% 


353.1 


1 



-104- 

[ 100 A [38.610 18.0962x10* | 1.1737xl0 / | 31.01% | 355.7:1 

%chan g e= E < (d) - E > (d) X 100 
EM 

Tlierefore base resolution down to individual bases can be achieved. A corollary to the base 
resolution argument is that the time spent in the donor layer by an individual acceptor can be 
5 determined. This translates into information about instantaneous rates of DNA movement. 
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In order to achieve optimal linear crossing of a polymer across a channel it is 

15 important to consider the channel diameter as well as the method used to direct the linear 
crossing of the polymer e.g., an electric field. The diameter of the channels should 
correspond well with that of the labeled polymer. The theory for linear crossing is that the 
diameter of the channels correspond well with that of the polymer. For example the ring-like 
sliding clamps of DNA polymerases have internal diameters that correspond well with the 

20 diameter of double-stranded DNA and are successful at achieving linear crossing of a DNA 
molecule. Many kilobases of DNA can be threaded through the sliding clamps. Several 
references also have demonstrated that linear crossing of DNA through channels occurs when 
the diameter of the channels corresponds well with that of the diameter of the DNA. 
(Bustamante, 1991; Gurrieri et al., 1990; Matsumoto et al., 1981). 

25 Single-stranded DNA, as used in the experiment, has a diameter of ~1 .6-nm. A 

channel having an internal diameter of approximately 1.7-3 nm is sufficient to allow linear 
crossing of a single strand DNA molecule. The diameters of the channel and the DNA need 
not match exactly but it is preferred that they be similar. For double- stranded DNA which 
has a diameter of 3.4-nm, channel sizes between 3.5-nm and 4.5-nm are sufficient to allow 

30 linear crossing. 
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As discussed earlier many methods may be used to move the polymer linearly across 
the channel and past the interaction station or signal generation station. A preferred method 
according to the invention utilizes an electric field. An electric field can be used to pull a 
polymer through a channel because the polymer becomes stretched and aligned in the 

5 direction of the applied field as has previously been demonstrated in several studies 
(Bustamante, 1991; Gurrieri et al., 1990; Matsumoto et al., 1981). The most related 
experiments regarding linear crossing of polymers through channels arise from experiments 
in which polymeric molecules are pulled through protein channels with electric fields as 
described in Kasianowicz et al., 1996 and Bezrukov et al., 1994, each of which is hereby 

10 incorporated by reference. A brief description of these experiments is presented below in 
order to illustrate one method for enabling linear crossing of polymers. 

In a study entitled, "Characterization of individual polynucleotide molecules using a 
membrane channel," Kasianowicz et al, 1996 demonstrate the linear crossing of DNA 
molecules through protein channels in a lipid bilayer with an electric field (also described in 

15 PCT Published Patent Application WO 96/29593). An excerpt of the abstract follows. 

We show that an electric field can drive single-stranded RNA and DNA molecules 
through a 2.6-nm diameter ion channel in a lipid bilayer membrane. Because the 
channel diameter can accommodate only a single strand of RNA or DNA, each 
polymer traverses the membrane as an extended chain that partially blocks the 
20 channel. 

The assay is performed using Staphylococcus aureus -hemolysin as the membrane 
channel. It has a diameter of 2.6-nm and can remain open for extended periods of time, 
allowing continuous ionic current to flow across a lipid bilayer. The hypothesis is that while 

25 a DNA molecule traverses a channel, there should be blockage in the ionic flow. Using 
single channel recordings, the blockage time is recorded. The length of blockage time 
corresponds to the length of the single-stranded DNA molecule passing through the channel 
if linear crossing occurs. Initially a potential of -120 mV was applied across the membrane, 
causing a current flow. When single-stranded DNA was added, two consecutive conductance 

30 blockades of 300 us and 1300 us, respectively occurred. 

The data supports two hypotheses: 1) the length of blockage time (denoted as 
"lifetime") is directly proportional to the length of the DNA and 2) greater applied voltage 
shortens the blockage time for a given length of DNA., indicating that linear crossing indeed 
occurred. As a control, it was demonstrated that double-stranded DNA do not cross the 

35 protein channels. 
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Bezrukov et al., 1994 have done similar studies with alamethicin pores in a paper 
entitled, "Counting polymers moving through a single ion channel." Alamethicin pores have 
internal diameters of 2 nm. The duration of the conductance blocks were proportional to the 
length of the polymer pulled through the pore, supporting the hypothesis for linear crossing. 

5 The results from Kasianowicz et al., 1996 and Bezrukov et al., 1994 demonstrate that an 
electric field can drive DNA across a protein pore channel in a linear fashion. 

An experiment was performed to demonstrate according to the invention that DNA 
can pass though fabricated nanochannels. The results of the experiment are shown in Figure 
1 1 . DNA in various forms was exposed to nanochannels to determine if it could pass 

10 through. Only double stranded linear DNA of 50 kb could pass through an array of 4 nm 
nanochannels. Folded DNA as modeled by a circular plasmid of the same size, cannot pass 
through the array. Lanes 1 and 2 are the controls. Circular DNA migrates slower than linear 
DNA as expected. Lanes 3 and 4 demonstrate that only linear DNA can pass through the 
nanochannel array. Since folded DNA which has a solvation diameter of approximately 5 

15 nm, cannot pass through the array, the only means by which the linear DNA passes through 
the plate is in a linear fashion. 

Using the system described above by Kasianowicz et al., a randomly labeled polymer, 
such as DNA can be analyzed by passing the polymer through the channel and making unit 
specific measurements. A channel can be prepared as a protein pore in a lipid membrane 

20 such as that described in Kasianowicz et al., and also in PCT published patent application 
WO/96/29593. Briefly, the S. aureus protein a-hemolysin is added to the cys side of a lipid 
bilayer. Lipid bilayers can be formed from, for example, diphytanoyl phosphatidylcholine by 
layering the solution on 0.2 mm holes in a Teflon film separating two compartments 
containing buffer solution. After the hemolysin is added, voltage can be applied across the 

25 bilayer and varied from 0 mV to 140 mV. DNA which is randomly labeled is added to the 
buffer on the cys side of the protein. A voltage is applied which causes the labeled DNA to 
traverse from this cys to the trans side of the channel, which has a positive charge. As each 
unit passes through the channel, a change in conductance as a result of the blockage of the 
channel occurs. The change in conductance is dependent upon the size, shape and charge of 

30 the unit passing through the channel. If the unit is labeled, the conductance change will 

reflect the properties of the label. In this manner, labeled unit can be identified. This method 
can be used to identify a particular unit or to identify specific order of units or distance 
between units or simply the number of units which are labeled. 
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Retrograde movement of the DNA is unlikely because of differences in frictional 
coefficients for inside the nanochannel and outside. The predicted van der Walls interaction 
between the solvated labeled DNA and the inside of the nanochannel creates a higher 
frictional resistance for the portion of the DNA located inside the nanochannel than that 
5 located in free solution. This is evidenced by the slower migration of DNA in the presence of 
a nanochannel plate (compare lanes 1 and 3 in figure 6). The differential in frictional 
resistance is likely to create a ratchet mechanism in the direct of the desired DNA movement. 

Another method for moving a polymer linearly past an interaction station or a signal 
generation station involves the use of a molecular motor. A molecular motor is a device 

10 which physically interacts with the polymer and pulls the polymer past the station. Molecular 
motors include but are not limited to DNA polymerases and helicases. DNA polymerases 
have been demonstrated to function as efficient molecular motors. Preferably the internal 
diameters of the regions of the polymerase which clamp onto the DNA is similar to that of 
double stranded DNA. Furthermore, large amounts of DNA can be able to be threaded 

15 through the clamp in a linear fashion. 

The overall structure of the p-subunit of DNA polymerase III holoenzyme is 80 A in 
diameter with an internal diameter of -35 A. In comparison, a full turn of duplex B-form 
DNA is ~34 A. The beta subunit fits around the DNA, in a mechanism referred to as a 
sliding clamp mechanism, to mediate the processive motion of the holoenzyme during DNA 

20 replication. It is well understood that the b-subunit encircles DNA during replication to 
confer processivity to the holoenzyme (Bloom et al., 1996; Fu et al., 1996; Griep, 1995; 
Herendeen and Kelly, 1996; Naktinis et al., 1996; Paz-Elizur et al., 1996; Skaliter et al., 
1 996). Because the sliding clamp is the mechanism of processivity for a polymerase, it 
necessarily means that large amounts of DNA are threaded through the clamp in a linear 

25 fashion. Several kilobases are threaded through the clamp at one time (Kornberg and Baker, 
1991). 

Methods for preparing the wall material of the invention are also encompassed by the 
invention. One method for preparing a wall material according to the invention includes the 
step of covalently bonding the agent to a plurality of discrete locations of a wall material. 
30 The agent is bonded to discrete locations on the wall material which are close enough to an 
interaction station, such that when an individual unit of a polymer, which is interactive with 
the agent to produce a signal, is positioned at the interaction station, the agent interacts with 
the individual unit to produce the signal. The discrete locations may be on the surface of the 
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wall material or may be within the wall material such that the agent is embedded in the wall 
material. 

Another method is for attaching a chemical substance selectively at a rim of a channel 
through a wall material that is opaque. An "opaque" material as used herein is a material 
5 which is light impermeable at a selected wavelength. 

The wall material is provided with photoprotective chemical groups attached at the 
rim of the channel through the wall material. Light is then applied to the photoprotective 
chemical groups to dephotoprotect the chemical groups, and a chemical substance is attached 
to the dephotoprotected chemical groups. 
10 A "photoprotective chemical compound" as used herein is a light sensitive compound 

which is capable of becoming chemically reactive when exposed to light. When light is 
applied to the photoprotective chemical groups the groups become dephotoprotected and are 
susceptible to interactions with chemical substances such as light emitting compounds and 
quenching compounds. 

15 Localized regions of agent can still be prepared even if the photoprotective chemical 

group covers the entire surface of the wall. The light may selectively be applied to regions of 
the wall on which it is desirable to have the agent localized. For instance the light may be 
applied only to the region of the wall surrounding the channel openings so that only those 
regions are dephotoprotected. When the agent is added it will only attach to the wall around 

20 the regions surrounding the channel openings. 

Additionally a method for preparing a wall material having localized areas of light 
emission on a surface of the wall material is provided. A light emissive compound is applied 
to the surface to produce at least localized areas of light emission on the surface. The light 
emissive compound may be added directly to the surface of the wall material or it may be 

25 attached indirectly to the surface through a photoprotective chemical group which has been 
attached to the surface and dephotoprotected by light. 

A "localized area of light emission" as used herein is a region of concentrated light 
emissive compound on the surface of the wall material which defines a target region around a 
rim of a channel through the wall material for detecting light emission. The localized area 

30 can be produced in several ways. Firstly, the light emissive compound may be attached 

directly to the wall surface only around the rim of the channel. Secondly, the light emissive 
compound may be attached to a photoprotective chemical group which has been attached 
selectively to localized areas around the rim of the channels. Alternatively the 
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photoprotective chemical groups may be added to the entire surface or to random regions of 
the surface of the wall material but only select regions are dephotoprotected by light to create 
localized regions to which the light emissive compound can be attached. Both the 
photoprotective chemical groups and the light emissive compound can be added to the entire 

5 surface or to random regions of the surface of the wall material when a mask is used to create 
localized regions of light emission. A mask having openings may be positioned over the wall 
surface such that the openings in the mask expose localized regions of the wall surface 
specifically around the openings of the channels. 

A wall material having localized areas of light emission on a surface of the wall 

10 material may also be prepared by first applying a light emissive compound to the surface to 
produce at least localized areas of light emission on the surface and then by creating a 
channel in the wall material wherein a rim of the channel forms a target region within the 
localized areas of light emission. 

An non-limiting example of a method for constructing an article of manufacture 

15 having a concentric ring of donor fluorophores around each channel is provided to 

demonstrate a preferred method of the invention. The method is achieved by the use of 
photolabile protecting groups and light diffraction. A light impermeable polycarbonate 
porous medium is coated on one side with a dense layer of covalent linkers. The linkers are 
protected by photolabile protecting groups. Light is transmitted through the back side of the 

20 light impermeable polycarbonate. Since the wavelength of the deprotecting light source 
(400-500 ran) is much greater than the size of the channels (5 ran), each channel acts as a 
point source of light. The intensity of transmitted light is greatest closest to the channel. 
Accordingly, only those groups close to the channels are deprotected. Deprotected groups 
are free to react with donor fluorophores in subsequent chemical reactions to generate 

25 concentric rings. 

The radius of the concentric ring of deprotection is controlled through understanding 
the diffractive nature of light. The long wavelength of light and the small size of the channels 
set up a situation of very strong diffraction where diffracted angles are greater than 90°. 
Light does not pass through the light impermeable polycarbonate surface, but rather is forced 

30 to undergo diffraction through the channels. According to Huy gen's principle, each channel 
acts as a secondary point source of light, spherically radiating. Huygen's principle states that 
all points on a wavefront can be considered as point sources for the production of spherical 
secondary wavelets. After a time t the new position of the wavefront will be that surface of 
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tangency to these secondary wavelets. A direct consequence of Huygen's principle is the 
generation of secondary point sources at the exit ends of the channels. A further consequence 
is that the intensity of excitation light decreases with increasing distance from the center of 
each channel. 

The amount of light exiting the channel and the resulting spherical distribution of 
light intensity can be calculated. The power of light exiting the channel is given as the cross- 
sectional area of the channel multiplied by the intensity of the incident light (I 0 ). 

P ~ Io^ Rchannel 

Upon exit of the channel, the light becomes a spherically radiating point source. In this 
particular case, the light source is restricted to radiate to half a sphere (a surface area of 4 tc 
r 2 /2). The new intensities (I(r)) as a function of the radial distance from the center of the 
channel is given as the power of light exiting the channels divided by the surface area of 
radiation at a given distance (r). 

j P Jo Rchannel 

~7 2r 

For a 60 W light source 1 cm away from the back side of the light impermeable 
polycarbonate, the intensity decreases from 4.77 W/cm 2 to 0.43 W/cm 2 at a distance 35 nm 
(Requai) from the center of the channel, corresponding to a 91% intensity change. From the 
above equation, the radius of light around each channel can be precisely controlled by the 
intensity of the light source. 

Photosensitive protecting groups have been described in great detail (Pillai, 1980). 
Depending on the type of covalent linker (amine, hydroxyl, carboxylic acid, ketone, 
sulfhydryl, etc.), the corresponding photosensitive protecting group is available. For 
example, amino groups can be protected with nitroveratryloxycarbonyl (NVOC), 2- 
nitrobenzyloxycarbonyl, and a-substituted 2-nitrobenzyloxycarbonyl groups. The latter two 
can also be used in the protection of carboxylic acids and hydroxyl groups. Photolytic 
deblocking of 2-nitrobenzyloxycarbonyl derivatives is straightforward, requiring only a 350 
nm lamp and ethanol. The time of deprotection is controlled from 1 to 24 hours. 

The invention also encompasses an apparatus for detecting a signal. The apparatus 
provides a support for the article of manufacture and a sensor for detecting the signals 
generated by the interaction which occurs as the polymer traverses the interaction station. 
The apparatus includes a housing with a buffer chamber, a wall defining a portion of the 
buffer chamber, and having a plurality of openings for aligning polymers, and a sensor fixed 
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relative to the housing, the sensor distinguishing the signals emitted at each opening from the 
signals emitted at the other of the openings to generate opening dependent sensor signals. 

An "opening dependent sensor signal" is a signal which arises at an opening in the 
wall material as a result of an interaction between a polymer and the station. 

5 The wall within the housing defines at least one and preferably two buffer chambers. 

A "buffer chamber" as used herein is area which is capable of supporting a liquid medium. 
The two buffer chambers may be in fluid communications with one another. 

The wall has a plurality of openings formed by channels within the wall. A "plurality 
of openings" within the wall means at least two openings formed by at least two channels. 

10 Preferably a plurality of openings is at least 50 openings. 

A "sensor" as used herein is a device which responds to a physical stimulus and 
transmits a resulting impulse in the form of a signal. Sensors include but are not limited to 
optical sensors, temperature sensors, pressure sensors, auditory sensors, magnetic sensors, 
electrical, mechanical, radioactive and motion sensors. Preferably the sensor is an optical 

15 sensor. As used herein an "optical sensor" is a device which detects and converts input 
electromagnetic radiation signals an impulse. The impulse can be measured and stored as 
data. Optical sensors are well known in the art and include microscopes. The sensor is fixed 
relative to the housing such that the sensor is capable of detecting signals generated at the 
interaction station. It is not necessary that the sensor be secured or attached directly to the 

20 housing. 

A microprocessor as used herein is a device for collecting and storing sensor signals. 
In general a microprocessor is a chip containing several electronic components such as ROM, 
RAM, registers, and I/O controls. Conventional microprocessors are well known in the 
electronics arts. 

25 An example of an apparatus constructed to hold a nanochannel (or microchannel) 

plate (120) which is capable of generating an electric field is presented in Figure 12. The 
electric field, created by electrodes (128, 130), is used to draw the DNA through the 
nanochannels. The exemplary nanochannel plate is immersed in a slightly viscous buffer 
solution which helps to slow the transit of the polymer through the nanochannel, so as to 

30 allow for a longer signal collection time per base. In addition, on either side of the plate are 
electrodes (128, 130) immersed in the buffer solution. The ensemble of nanochannel plate, 
buffer compartments (122), and electrodes are contained in an optical quality glass chamber 
suitable for image analysis and are positioned adjacent to a 60x1 .4NA oil objective (126). 
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As discussed above the use of an electric field to cause the polymer to move linearly 
through a channel is preferred. The use of an electric field is suitable because the stretched, 
linear orientation of a polymer in an electric field is favorable for linear crossing of 
nanochannels. Furthermore, the rate of polymer movement can be controlled by voltage. 

5 Lastly, an electric field does not adversely affect FRET. 

Light microscopy (Bustamante, 1991; Gurrieri et al., 1990; Matsumoto et al., 1981, 
Rampino and Chrambach, 1990; Schwartz and Koval, 1989; Smith et al., 1989), linear 
dichroism (LD) (Akerman et al., 1990; Akerman et al., 1985, Moore et al., 1986), 
fluorescence-detected LD (Holzwarth et al., 1987), and linear birefringence (Sturm and 

10 Weill, 1989; Chu et al, 1990) can be used to study the instantaneous changes in shape of 
DNA molecules undergoing gel electrophoresis. In these studies DNA is shown to be 
strongly oriented and stretched. 

Guirrieri et al., 1990 has demonstrated linear and stretched conformation of DNA 
molecules in an electric field. In each of the cases, the DNA molecule is clearly aligned in 

15 the direction of the applied electric field. The method used to visualize the DNA molecules 
combines fluorescent DNA labeling and use of an image intensifier- video camera system 
(Bustamante, 1991; Houseal et al., 1989; Morikawa and Yanagida, 1981; Matsumoto et al., 
1989; Yanagida et al., 1983). The DNA molecules shown are T2 molecules which are 164 
kbp long. 

20 The orientation of DNA in an electric field has been well studied with linear 

dichroism and electric dichroism (Ding et al., 1972; Yamaoka and Charney, 1973; Colson et 
al., 1974; Hogan et al., 1978; Priore and Allen, 1979; Yamaoka and Matsuda, 1981; Wu et 
al., 1981). In fact, the first studies done on DNA orientation have been performed with these 
two techniques. DNA was first studied in solution and then subsequently in electrophoretic 

25 gels. Studies both in solution and in gels yield similar results in that the DNA molecules are 
indeed oriented in the direction of the electric field. 

Native DNA exhibits a negative UV linear dichroism (LD) as a result of the 
preferential orientation of the nucleotide bases nearly perpendicular to the field. The 
orientation of DNA has been attributed to the presence of either a permanent or an induced 

30 dipole moment. For example, Ding et al., 1 972 describe their observation of DNA in a Tris- 
cacodylate buffer diluted in ethanol to give 80% v/v ethanol. 
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LD data from single electric pulse of 9 V/cm. Duration of pulse is indicated by the 
horizontal bar. At steady-state, the plateau of the LD r reads close to -1.5, which means that 
the DNA is oriented in the direction of the electric field (Akerman et al., 1990). 

Akerman et al., 1990 have performed LD studies on pulsed-field gel electrophoresis 
5 and similar results are obtained as in solution. Akerman provides a plot of LD r versus time. 
LD r represents orientation in the electric field. A value of -1.5 means that the DNA molecule 
is oriented in the electric field. A horizontal bar on the x-axis denotes the duration of the 
electric pulse. The DNA is in 1% agarose. A short time after the beginning of the pulse, the 
curve plateaus at a LD r close to -1 .5, meaning that the DNA is oriented a short time after the 
10 beginning of the pulse. When the pulse is turned off, the DNA no longer orients itself in the 
field and the LD r curve no longer remains at -1 .5. 

The rationale for DNA orientation in electric fields is based on either a permanent or 
induced dipole moment in DNA. Recall that dipoles align themselves in the direction of 
electric fields to minimize torque. Hogan et al., 1978 have proposed a rationale for the 
15 induction of a dipole moment in DNA that relates to anisotropic ion flow: 

In order to explain the dependence of the dichroism on the electric field, the ionic strength 
of the medium, and the length of the macromolecule, we propose a new model in which 
anisotropic ion flow produces an asymmetric ion atmosphere around the polyelectrolyte, 
resulting in an orienting torque. 
20 The conclusion from the discussion on DNA alignment in electric fields is that DNA 

molecules and other polymers align themselves in the direction of electric fields whether in 
an electrophoretic gel or in solution. Solution studies were performed before electrophoretic 
studies. The implications of DNA alignment in an electric field further support the fact that 
DNA molecules and other polymers can be driven across nanochannels in a linear fashion. 
25 The unfolded orientation of the DNA is also important in the linear crossing of DNA 

molecules. As described above DNA and other polymeric molecules can be driven through 
protein channels in a linear fashion with an electric field. The data described herein 
rationalize this further and give a theoretical basis to why DNA molecules can behave in such 
a fashion. Given that DNA has been shown to pass through protein pores in a linear fashion 
30 and that DNA molecules align themselves in the direction of electric fields, linear crossing of 
DNA through nanochannels can be achieved. 

The rate at which a polymer moves is also important because the duration of energy 
transfer is important. The longer an acceptor remains in the donor fluorophore layer, the 
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greater the signal generated. Millisecond integration times allow for unequivocal signal 
detection. Since the layer of .donor fluorophore is roughly 40 A, the rate of polymer 
movement that is needed can be approximated as 40 A/10 ms, or 4000 A/s. In nanochannel 
FRET sequencing, the proposed rate is achieved by controlling either the voltage of the 
5 applied electric field or the frictional coefficient of the polymer molecule. 

The rate of DNA movement can be determined given the equation that defines the 
movement of a DNA strand in an electric field (Tinoco et al., 1995): 

ZeE 

where Z is the (unitless) numbers of charges, e = 1. 6022x1 0" 19 coulombs, E is the electric 

10 field in colts per m, and /is the frictional coefficient in kg/s. The velocity of motion this 

depends on the magnitude of the electric field E, the net charge of the molecule, and the size 
and shape of the molecule as characterized by its frictional coefficient,/ The net charge on 
the molecule is designated by Ze. The frictional coefficient can be determined from the 
following equation (Tinoco et al., 1995): 

15 f = kf o = k(6xT]r) 

where h is the viscosity coefficient, r is the radius for a sphere having the same volume as 
the DNA, k is a shape factor (which for a strand of DNA is about 1 .7), and/, is the frictional 
coefficient for a sphere having the same volume as the DNA. 

Rationally, it can be seen that either the frictional coefficient or the magnitude of the 

20 electric field can be used to control the rate of polymer movement. For different voltages in a 
given system, there are different polymer migration rates. In a similar fashion, larger 
molecules or molecules in more viscous media have slower molecular mobility. From 
examples in gel electrophoresis, calculations of the effects of electric field strength, polymer 
size, and frictional coefficient are possible. It is important to understand a viscous medium 

25 can achieve the same desired effects on the frictional coefficient as a gel. Since the frictional 
coefficient is inversely proportional to the electrophoretic rate, doubling viscosity decreases 
electrophoretic rate by two-fold. 

Electrophoresis in viscous solvents has been performed by a number of groups 
(Chang and Yeung, 1995; Bello et al., 1994; Jumppanen and Riekkoloa, 1995; Sahota and 

30 Khaledi, 1994; Klenchin et al., 1991 ; Harris and Kell, 1985; Korchemnaya et al., 1978). 

Sahota and Khaledi, 1994 demonstrate electrophoresis in formamide, which has a viscosity of 
3.3 cP at 25°C, triple that of water (0.89 cP, 25°C). Electrophoresis in formamide is more 



-115- 

favorable than in water because formamide has a higher dielectric constant than water and 
can solubilize many supporting electrolytes. It is commonly used in mixtures with water in 
electrophoretic buffers. Mixed glycerol solutions, N-methyl-2-pyrrolidone, acetic acid, and 
iV-methyl formamide have also been used (Korchemnaya et al., 1978). A large range of 

5 viscosities are possible, ranging from 1 cP to as high as 25 cP (Bellow et al., 1994). The 
actual viscosity of the solution needed depends on the frictional coefficient of polymer 
molecules in nanochannels. The net frictional coefficient thus depends on viscosity and also 
the frictional effects of the nanochannels on the polymer. From the small bore of the 
nanochannel, there are frictional effects from solvent trapping between the walls of the 

10 nanochannels and the labeled polymer. The magnitude of these solvent trapping effects is 
similar to those arising from gel electrophoresis. As a result, only a slightly viscous solution 
may enhance the desired rate of polymer movement. 

For a given electric field, the electrophoretic mobility for different sized molecules 
can be determined by knowing the relationship between molecular weight and distance 

15 migrated, given as: 

log M = a - bx 

where Mis the molecular weight of the nucleic acid and x is the distance migrated 
(proportional to mobility), a and b are constants for a given electric field. 

To summarize, the rate of polymer movement in an electric field can be controlled by 

20 the electric field, frictional coefficient, and molecular size. Desired values for the rate of 
polymer movement are readily achievable. 

There are no adverse effects of an electric field on FRET. Since energy transfer is 
related to short range electrical dipole interactions, the presence of an outside electric field 
may negatively affect the ability of donor-acceptor pairs to undergo FRET. The presence of 

25 an external electric field has no adverse effects on FRET as seen from recent studies 
performed with FRET primers and gel-based automated DNA sequencing (Glazer and 
Mathies, 1997; Hung et al, 1996; Ju et al, 1996a; Ju et al, 1996b; Marra et al., 1996; Wang 
et al., 1996; Wang et al., 1995). FRET primers are used in the labeling of sequencing 
reaction products because of the large Stake's shift and thus greater fluorophore 

30 discrimination. In these experiments, the FRET labeled DNA fragments are run on a gel and 
concurrently, in the presence of the electric field, the fragments are detected with a CCD 
camera and laser. Either a four-color capillary electrophoresis system or Applied Biosystems 
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373A sequencer is used. These experiments demonstrate that energy transfer occurs in the 

presence of a strong electric field. 

An exemplary design of a nanochannel apparatus is shown in Figure 13, which 

consists of two fused Pyrex cells (132, 134) that hold the nanochannel plate (140). The upper 
5 (136) and lower (138) buffer regions are contained in the smaller (134) and larger (132) 

Pyrex cells, respectively. The buffer regions are sealed off with glass coverslips (146) that 

have been modified with indium tin oxide (ITO). Indium tin oxide is a clear conductor which 

functions in this system as electrodes (142). The electrodes lead to a variable voltage supply 

(152) consisting of a load resistance R L (one to several G , to approach a constant current 
10 system) and a rheostat-controlled voltage from a voltage source. A voltmeter is connected in 

parallel. Under the glass coverslip an oil (144) coated immersion objective (150) is 

positioned to detect fluorescence signals (148). 

The polymer is loaded into the upper buffer region and the electric field is used to 

drive the polymer to the lower chamber. The buffer has the desired viscosity and electrolytic 
15 properties. Epiillumination and signal collection are possible through the same 60x, 1 .4 NA 

oil immersion objective. The dimensions of the apparatus are similar to that of a 

conventional microscope slide (75 mm x 25 mm) for ease of mounting. The gap between the 

nanochannel plate and the glass coverslip is 0.4 mm. A thin coverslip with a thickness of 0. 1 

mm is shown. These dimensions are preferred because long working distance objectives are 
20 available. For instance, the CFI60 system from Nikon has 60x, 1 .4 NA objectives that can 

examine specimens over 0.6 mm thick. 

A temperature control block is used to prevent Joule heating and thermal fluctuations. 

The block can surround the nanochannel apparatus and has inlet/outlet ports for coolant flow. 

The block is hollow to allow uniform temperature control of the nanochannel apparatus. 
25 There are inlet/outlet ports on either side of the apparatus that allow the recycling of the 

coolant through the hollow chamber. 

Several groups have used specialized apparatuses for fluorescence microscopy 

observations of electrophoresis (Rampino and Chrambach, 1991; Matsumoto et al., 1981; 

Smith et al., 1989; Bustamante, 1991; Gurrieri et al., 1990; Schwartz and Koval, 1989; Smith 
30 et al., 1990). The apparatus used by Smith et al., 1990 for the observation of DNA molecules 

undergoing pulsed-field gel electrophoresis is described: 

A 20-ul drop of molten agarose was placed between two microscope coverslips (24 mm x 
24 mm) and allowed to cool for 10 minutes. This sandwich arrangement was placed on 
top of a regular microscope slide and sealed at the four corners with fingernail polish. 
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The slide had been previously prepared with four copper electrodes fixed on it with epoxy 
adhesive. This apparatus was then refrigerated at about 5°C for 30 minutes. Molten 
agarose was dropped over the electrodes to complete the electrical connection, and TBE 
buffer (0.5x) was occasionally added to keep it wet. 

5 

Rampino and Chrambach, 1991 describe a more complex apparatus that has larger 
buffer regions, platinum strip electrodes, and a region for coolant flow. The entire apparatus 
is mountable on the microscope stage. The actual gel itself is sandwiched between two glass 
coverslips and placed in the center of the apparatus, in contact with buffer blocks on either 

10 side. The design allows for a homogenous electric field and temperature control. In contrast 
to the apparatus described above, the Rampino and Chrambach apparatus is designed for 
standard and not pulsed-field gel electrophoresis. 

Briefly, a description of how the apparatus (shown in Figure 13) of the invention 
functions in relation to polymer is described below. A polymer such as DNA is labeled 

15 (intrinsically or extrinsically) by the methods of the invention or any other method known in 
the art. The labeled DNA is placed in a buffer solution, which is preferably slightly viscous. 
The buffer is added to the upper buffer chamber 136. An electric field is created using the 
electrodes and the DNA is caused to enter the nanochannels of the nanochannel plate (140) in 
a linear manner. As the labeled DNA emerges from the other side of the nanochannel plate 

20 the label is caused to interact with an agent or the environment surrounding the channel to 
produce a signal or polymer dependent impulse which is detected by the detection device. 
The detected signal or polymer dependent impulse is stored and processed to reveal structural 
information about the polymer. 

A final aspect of the invention encompasses a method for preparing nucleic acids for 

25 use according to the methods of the invention as well as for any other use in which it is 
desirable to utilize randomly labeled nucleic acids. The method involves the steps of 
contacting a dividing cell with a nucleotide analog, isolating from the cell nucleic acids that 
have incorporated the nucleotide analog, and modifying the nucleic acid with incorporated 
nucleotide analog by labeling the incorporated nucleotide analog. A "nucleotide analog" as 

30 used herein is a molecule which can be substituted for A, T, G or C but which has a modified 
structure. Nucleotide analogs include for example but are not limited to a brominated analog, 
2,4-dithiouracil, 2,4-Diselenouracil, hypoxanthine, mercaptopurine, 2-aminopurine, and 
selenopurine. The incorporated nucleotide analog may be labeled with an agent selected 
from the group consisting of a light emitting compound, a quenching source and a 

35 fluorescence excitation source. 
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A dividing cell is "contacted" with a nucleotide analog by any means known in the art 
for incorporating nucleotides into living cells. For instance the nucleotide analog may be 
added to the cell culture medium and taken up by the cell naturally. In order to optimize the 
uptake of the nucleotide analog the dividing cell may be growth arrested using conventional 
means prior to adding the nucleotide analog to the medium and then the arrest removed to 
allow the cells to reenter the cell division cycle once the nucleotide analog is added to the 
medium. The nucleic acids having the incorporated nucleotide analog may then be isolated 
after the cells have reentered and completed at least one cell division cycle. 

An example of a method for incorporating a nucleotide analog into DNA is provided 
in Bick and Davidson, Proc. Nat. Acad. Set, 71 :2082-2086 (1974). Bick and Davidson grew 
the BrdU-dependent cell line, B4, in a basic growth medium of Dulbecco's modified Eagle's 
medium supplemented with 10% fetal-calf serum (E medium) containing 0.1 mM BrdU. 
100-mm Falcon plastic tissue culture dishes were inoculated with 10 6 B4 cells in E medium 
containing 0.1 mM hypoxanthine, 0.4 uM aminopterin, and 10 uM BrdU (E-HAB medium) 
and the cells were passaged three times at high density (5 X 10 5 cells per 100-mm dish) in E- 
HAB medium over six weeks. The cells were then plated at low density (1000 cells per 6- 
mm dish) in E-HAB medium and three weeks later, approximately 10 large colonies were 
observed in each dish. The cells from one dish were harvested and maintained as a new cell 
line called HAB. Because of the expected photosensitivity of BrdU-containing cells, the 
HAB cells were at all times protected from environmental lighting. After 100 cell 
generations in E-HAB medium, an aliquot of HAB cells was transferred back to E medium. 
A new subline of cells, called HAB-E, was isolated and maintained in E medium. 

Bick and Davidson grew the cell in the media (described above) to which was added 
H 3 32 P0 4 (New England Nuclear Corp.) at an activity of 2 uCi/ml. After 2-3 population 
doublings the cells were harvested and DNA was isolated as described in Davidson and Bick 
{Proc. Nat. Acad. Set, 70:138-142 (1973)) except that after the first phenol extraction, DNA 
preparations were routinely treated with RNase A at a concentration of 50 ug/ml for 60 min 
at 37°. After a second phenol extraction, the DNA was extensively dialyzed against 10 mM 
Tris-HCl, 10 mM pH 7.6, 1 mM EDTA, and then finally dialyzed against 10 mM Tris-HCl 
pH 7.6, 1 mM EDTA. Other methods for isolating DNA are well known in the art. See e.g., 
Sambrook. et al., Molecular Cloning: A Laboratory Manual, 2nd edition, Cold Spring Harbor 
Laboratory Press, 1989). 
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The method of the invention for preparing nucleic acids is referred to hereinafter as in 
vitro base specific labeling (IBSA). IBSA involves the culture of cells derived from a subject 
in the presence of nucleotide analogs. The nucleotide analogs are either capable of 
subsequent chemical modifications, have a high molar extinction coefficient in the donor 

5 emission wavelengths, or have linkers for the attachment of acceptor labels. The most 

important analogs are the ones which contain functional groups that do not interfere with base 
pairing during DNA replication. The 8-substituted purines, 5 -substituted pyrimidines, and 6- 
substituted pyrimidines satisfy this criteria. 

A non-limiting example of an IBSA scheme is set forth in detail below. The method 

10 is outlined in Figure 14 and covers the steps from initial DNA isolation to genomic labeling 
and to final sequence analysis. Cells are isolated from the subject and then grown. At a time 
when there are enough cells to provide adequate genetic material, the cells are arrested in the 
cell cycle. The arrest can occur at any point in the cycle as long as it does not interrupt the S 
phase. Nucleotide analogs that can be easily chemically derivitized, such as 8-bromo-2'- 

15 deoxypurines and 5-bromo-2'-deoxypyrimidines, are added to the growth medium. The cells 
are then grown for one cycle and the genomic DNA from the cells are isolated. The resulting 
DNA is a chimera of labeled and unlabeled DNA, a product of semiconservative replication. 
The advantage in having the DNA with only one strand labeled is so that the resultant duplex 
DNA can be used in a sequencing method of the invention. 

20 The genomic DNA is broken down into sizes of approximately 100 kb. The genomic 

DNA is chemically labeled and end-labeled at this point. The advantage of culturing cells in 
brominated analogs is that such a modification activates the bases for further modification, 
the addition of acceptor labels to the bases. For a human genome, there are approximately 
thirty thousand fragments. The fragments are analyzed simultaneously by the sequencing 

25 methods of the invention. Fragments of similar size and potential sequences are grouped 
together and population analyses are performed to generate DNA sequence information. 
Sequencing of a genome can be completed in approximately six hours. The time from 
isolation of a subject's cell to the end of sequencing is approximately one week. The 
following paragraphs describe each individual step in detail. 

30 The first step to the outlined scheme is cell culture and cell cycle arrest. The purpose 

of the cell cycle arrest is to generate a synchronous population of cells which can undergo 
DNA replication at the same time to ensure all the resultant DNA will be a chimera of labeled 
and unlabeled DNA. There are many methods to synchronize cells including methods which 
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arrest cells at either M or S phase. Only an M phase block would produce the desired 
genomic chimeras. Examples of metaphase inhibitors include nitrous oxide under pressure, 
vinblastine (Marcus & Robbins, 1963), and colcemid. Colcemid blocks cells in metaphase 
but can be reversed by washing (Mitchison, 1971; Kato & Yoshida, 1970; Stubblefield, 

5 1 968). Cells are grown for a few hours in colcemid, resulting in a synchronous culture 
(Wunderlich & Peyk, 1969). Nitrous oxide under pressure (Rao, 1968) is advantageous in 
that it can be easily removed. Removal of the block and addition of the desired brominated 
base analog allows for a synchronous entrance into S phase. 

The nucleotide analogs which can be used include but are not limited to 8-bromo-2'- 

10 deoxyadenosine, 8-bromo-2'-deoxyguanosine, 5-bromo-2'-deoxycytidine, and 5-bromo-2'- 
deoxyuridine. These brominated nucleotide analogs are activated for coupling to 
nucleophiles (Traincard et al., 1983; Sakamoto et al, 1987; Keller et al., 1988; Hermanson, 
1996). Coupling of amine-containing acceptor labels and fluorophores to the base analogs is 
done at ambient temperatures (35°C). The final positions of the acceptor labels are such that 

15 the base-pairing of the labeled fragment is not affected. 

Brominated nucleotide analogs are commercially available (Sigma Corp., Fluka Inc., 
Fisher Scientific, Inc.) or can be prepared by reaction with brominating reagents. 
Hermanson, 1996 outlines a protocol for bromination of DNA at thymines, cytidines, and 
guanosines using N-bromosuccinimide (NBS). Adenine residues are prepared from other 

20 methods which do not require NBS. For an example, 8 -bromo-2' -deoxyadenosine and 8- 
bromo-2'-deoxyguanosine can be synthesized by reaction of the native base with bromine 
water (Ikehara et al., 1969a; Ueda et al., 1974; Faerber & Scheit, 1971; Kochetkov et al., 
1968). Bromination at the eighth position of the purines can also achieved via the nucleotide 
morpholidate (Lang et al., 1968; Ikehara et al., 1969b). The solvent condition for 

25 bromination is especially important. Bromination of 2'-deoxyadenosine does not take place 
in dimethylformamide at 0°C, but if the reaction is conducted at 50-60°C in glacial acetic 
acid, 8-bromo-2'-deoxyadenosine is formed relatively easily (Gueron et al., 1967; Rahn et al., 
1965). 

Brominated and other base analogs have been demonstrated to be efficiently 
30 incorporated into genomic DNA during cell culture (Balzarini et al., 1984). A particular 

example is the incorporation of 5-bromo-2'-deoxyuridine into cells used for flow cytometry. 
5-bromo-2'-deoxyuridine is incorporated into newly synthesized DNA in the place of 
thymidine (Crissman et al., 1990; Poot & Hoehn, 1990; Bohmer, 1990; Gaines et al, 1996; 
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Nicolas et al., 1990). The percentage of substitution of base analog can be very high as 
according to Bick and Davidson, 1974 in their paper discussed above and titled, "Total 
Substitution of Bromodeoxyuridine for Thymidine in the DNA of a Bromodeoxyuridine- 
Dependent Cell Line. The following is a paragraph from this paper: 

To obtain total substitution, BrdU-dependent cells were exoposed to culture 
medium containing aminopterin, a powerful inhibitor of thymidine 
biosynthesis, and BrdU in the absence of added thymidine... We report 
here the isolation of a cell line in which at least 99.8% of the thymidine in 
the nuclear DNA has been replaced by BrdU. 

In the current proposed random one-base labeling scheme, percentages as high as 
99.8% substitution are not necessary but such data gives insight into possibilities of cellular 
incorporation of base analogs. Other brominated base analogs can be similarly incorporated 
into mammalian cells with ease (Stewart et al., 1968). A few examples of non-bromine base 
analogs that have been incorporated in this manner include 2-aminopurine (Glickman, 1985), 
5-propynyloxy-2'-deoxyuridine, and 5-ethynyl-2'-deoxyuridine (Balzarini et al., 1984). 

After generation of the appropriately brominated DNA chimeras through cell culture, 

the chimeric genomic DNA is prepared for analysis by performing several steps, including an 

optional step of chromosome sorting. 100 kb DNA fragments can be generated from the 

partial digestion of flow sorted chromosomes. The optional flow sorting step has been 

covered in detail by many manuals and textbooks (de Jong et al., 1989; Fawcett et al., 1994) 

and will not be discussed in detail. In brief, the cells are arrested in metaphase by the 

addition of colcemid. Several million of the cultured cells are gently lysed to release the 

metaphase chromosomes into a suspension buffer in which they are stained with one or more 

fluorescent dyes. The target chromosomes are then identified by their pattern of fluorescence 

emission, using either a single fluorophore in a one laser system, or two fluorophores in a two 

laser detection system. The sorted chromosomes are then prepared and partially digested as 

according to protocols described by Glover and Hames, 1995 in DNA Cloning 3. Sizes on the 

order of 100 kb are obtained. 

Fluorophore labeling of the brominated DNA can be done either before or 
after partial digestion. The base bromination activates the bases for 
nucleophilic displacement. Many different fluorophores with 
amine/hydrazine groups can be used including the popular dyes fluorescein, 
coumarin, eosin, rhodamine, and Texas Red (Molecular Probes, OR). These 
have absorptions and emissions in the visible region. The amine group on 
these fluorophores attack the 8 position of the purines and the 5 position of 
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the pyrimidines, undergoing a nucleophilic displacement reaction. The 
experimental protocol for the displacement reaction is described by 
Hermanson, 1996. A modified outline of the labeling scheme described is 
given below. 

Protocol for labeling Brominated DNA 

1 . Dissolve a hydrazine or amine-coupled fluorophore in water at a concentration of 80- 
100 mM. 

2. Add 25 pi of the fluorophore solution to a bromine-activated DNA solution. 

3. React for 1 h at 50°C. 

4. Purify by ethanol precipitation. Add 20 ul of 4 M LiCl and 500 ul of ethanol (chilled 
to-20°C). Mix well. 

5. Store at -20°C for 30 min, then separate the precipitated DNA by centrifugation at 
12,000 g. 

6. Remove the supernatant and wash the pellet with 70% and 100% ethanol, centrifuging 
after each wash. 

Redissolve the labeled DNA pellet in water and store at -20°C. 

From the genomic partial digestion, approximately thirty thousand 100 kb fragments 
are created. The fragments are end-labeled either with terminal transferase or through 
ligation to short labeled polynucleotides. The fluorescent labels incorporated by end-labeling 
are uniquely identified either by different spectral absorbances or a specific sequence of 
labels (as in the ligation scheme). After end-labeling, the prepared DNA is subject to 
nanochannel FRET sequencing as described above. The data is sorted by fragment size and 
potential sequence. Through population analysis, the DNA sequence is generated. 

Figure 15 presents the general scheme for deciphering DNA sequence from IBS A 
labeling. A sample of genomic DNA has two of its bases labeled (in this case A and C). The 
end-labeled and internally labeled DNA fragments are analyzed by FRET sequencing. The 
resulting data is sorted by fragment size and potential sequence. Potential sequence is 
defined as the pattern of sequence-specific FRET signals generated for a particular fragment 
of DNA. A potential sequence does not give sequence information, but rather allows DNA 
molecules to be uniquely identified. Population analyses are performed to determine all the 
positions of the A's and C's on each of the fragments. Complementary strands are paired. 
The process of determining base position is repeated for only one of the bases (A). 
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Comparison of the resulting data for one labeled base and two labeled bases generates the 
DNA sequence. 

Hundred kilobase fragments are not always desired for sequencing. Expression 
mapping and single gene sequencing require much shorter labeled fragments. The generation 
of shorter labeled fragments can be achieved by nick translation, primer extension, and the 
polymerase chain reaction (PCR). In addition, smaller fragments can also be analyzed by a 
modification of the Sanger reaction and the use of terminal transferases. With enzymatic 
methods, large nucleotide analogs can often be incorporated (Hermanson, 1996). 

In primer extension, modified nucleoside triphosphates are added to a DNA template 
using a defined amount of the desired primer. The form of polymerase used is the Klenow 
fragment, which lacks the 5'-3' exonuclease activity of intact E. coli DNA polymerase I 
(Kessler et al., 1990; Feinberg and Vogelstein, 1983, 1984). Equivalent enzymes are the 
Sequenase 2.0 and T7 polymerases. This method is a simple method of preparing internally 
labeled target DNA. Using the method, synthesis of large internally labeled DNA fragments 
with lengths greater than 10 kilobases have been synthesized with one of the nucleotides 
replaced by a modified base. A brief summary of the protocol involves using Sequenase 2.0 
(Amersham). The annealing reaction consists of 1 ul primer (0.5 pmol/ul), 2 ul Sequenase 
buffer (5X), 5 ug plasmid DNA, and 10 ul sterile distilled water. After incubation in a 65°C 
waterbath for 2 minutes, annealing is allowed to occur at 30°C for 30 minutes. The actual 
reaction mixture consists of 10 ul of annealing reaction (primer-template), 1 ul dithiothreitol 
(0.1 M), 1 uM of each dNTP, 0.1 uM of a fluorophore dNTP, and 0.025 U/ul of Sequenase. 
After mixing, the reaction is allowed to proceed at room temperature for 15 minutes. 

Using primer extension, Ambrose et al. (1993) and Harding and Keller (1992) have 
demonstrated the synthesis of large fluorescent DNA molecules. In their experiments, 
complete replacement of one pyrrolidine with a fluorophore analog was achieved in DNA 
longer than 7 kb. Fluorescent DNA between 5 to 7 kb in length, in which two pyrimidines are 
completely labeled, have also been reported by the same authors. Large fragments using 
other unusual base analogs such as 7-(2-oxoethyl)guanine, 2'-amino-2'-deoxycytidine, 8- 
oxopurine, and N-4-aminodeoxycytidine have been synthesized as well (Barbin et al., 1985; 
Purmal et al., 1994; Aurup et al., 1994; Negishi et al, 1988). 

Nick translation takes advantage of the ability of Escherichia coli DNA polymerase I 
to combine the sequential addition of nucleotide residues to the 3'-hydroxyl terminus of a 
nick [generated by pancreatic deoxyribonuclease (Dnase) I] with the elimination of 
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nucleotides from the adjacent S'-phosphoryl terminus (Meinkoth & Wahl, 1987; Rigby et al., 
1977; Langer et al, 1981; Holtke et al., 1990). Many modified bases have been incorporated 
by nick translation (Meffert and Dose, 1988; Gebeyehu et al., 1987; Gillam and Tener, 1986). 
DNA polymerase I causes breaks to be filled as rapidly as they are formed, incorporating the 
desired nucleotides into the original strands. Since a quantity of labeled nucleoside 
triphosphates are present during the reaction, the labels get incorporated and the parent 
strands are modified. Nick translation involves a mixture of the double-stranded DNA target, 
1 ul of DNAse I at a concentration of 2 ng/ml, 1 al each of three types of unmodified 
deoxynucleoside triphosphates (dNTPs at 100 uM concentration), 1 ul of a labeled dNTP (at 
300 uM), 32 ul water, and 1 ul of DNA polymerase containing 5-10 units of activity. 
Reaction proceeds for 1 h at 15°C. Reaction is quenched by 4 ul of 0.25 M EDTA, 2 ul of 10 
mg/ml fRNA, and 150 ul of 10 mM Tris, pH 7.5. Labeled DNA is purified by ethanol 
precipitation (Bethesda Research Laboratories, New England Nuclear, and Amersham). 

Direct PCR labeling not only incorporates the desired label into the target DNA, but 
also amplifies the amount of labeled DNA (Saiki et al., 1985,1988). PCR uses the heat-stable 
forms of DNA polymerase, most commonly, the Taq polymerase from Thermus aquaticus. 
Tag polymerase has the capability to incorporate labels into growing DNA copies with each 
cycle of PCR. In this manner, DNA fragments up to 3 kb can be labeled. Wiemann et al., 
1995 describe the method of internally labeling PCR products with fluorescein- 1 5 -dATP, a 
protocol that can be applied to other fluorophores. The reaction mixture for PCR 
incorporation of fluorophores consists of 1 uM of each primer, 200 uM of each dNTP, 10 uM 
of a fluorophore-dNTP, and 0.025 U/ul of Taq polymerase (Perkin-Elmer, Norwalk, CT). 
The reaction buffer consists of 10 mM Tris-HCl, pH 8.3, 50 mM KC1, 1.5 mM MgCl 2 , and 
0.01% (wt/vol) gelatin. PCR is performed using thirty cycles. 

Sequencing reactions generated by the Sanger method can also be analyzed with the 
aid of enzymatic labeling. Each chain-terminated fragment is labeled with a 5' and 3' 
fluorophore. The 5' fiuorophore is incorporated as part of the primer and the 3' end-label is 
added by terminal transferase. Terminal transferase labeling was originally developed using 
radiolabeled nucleoside triphosphates (Roychoudhury et al, 1979; Tu and Cohen, 1980). The 
technique was later applied to non-radioactive nucleotide analogs (Kumar et al., 1988). The 
technique involves addition of the target DNA to (a) 20 ul of 0.5 M potassium cacodylate, 5 
mM CoCl 2 , 1 mM DTT, pH 7, (b) lOOuM of a modified deoxynucleoside triphosphate, 4ul of 
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5 mM dCTP, and 100 pi of water. Terminal transferase is added to a final concentration of 
50 units in the reaction mixture and reacted for 45 minutes at 37°C (Hermanson, 1996). 

A post-labeling procedure can be used as well. In this method, a base analog with a 
linker arm for attaching a fluorophore is initially incorporated into the DNA using the 
5 methods described above. Subsequent attachment of fluorophores to the linker arms is 

possible using covalent linking techniques such as those described by Waggoner (1995). Jett 
et al. (1995) has demonstrated the full length synthesis of M13 DNA (7250 bp) with complete 
incorporation of one of the following: 5-(3-aminopropynyl)-dCTP, 5-(3-aminoallyi)-dCTP, 
or 5-(3-aminoallyl)-dUTP. Covalent fluorophore attachment to the linkers was also shown. 

10 The advantage of this technique is that the label need not consist of individual fluorescent 
molecules, but could be larger molecules with greater number of dyes. Such examples 
include phycobiliproteins, dye-filled beads, tagged proteins, or chains of fluorescent tags. 

Each of the foregoing patents, patent applications and references is herein 
incorporated by reference in its entirety. Having described the presently preferred 

15 embodiments in accordance with the present invention, it is believed that other modifications, 
variations and changes will be suggested to those skilled in the art in view of the teachings set 
forth herein. It is, therefore, to be understood that all such variations, modifications, and 
changes are believed to fall within the scope of the present invention as defined by the 
appended claims. 

20 What I claim is: 
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